基于随机博弈与禁忌搜索的网络防御策略选取被引量：6

Selection of Network Defense Strategies Based on Stochastic Game and Tabu Search

作　　者：孙骞薛雷琦[2] 高岭王海[2] 王宇翔 Sun Qian;Xue Leiqi;Gao Ling;Wang Hai;Wang Yuxiang(Contemporary Educational Technology Center,Northwest University,Xi'an 710127;State-Province Joint Engineering and Research Center of Advanced Networking and Intelligent Information Services,School of Information Science and Technology,Northwest University,Xi'an 710127;State-Province Joint Engineering and Research Center of Advanced Networking and Intelligent Information Services,College of Computer Science,Xi'an Polytechnic University,Xi'an 710600)

机构地区：[1]西北大学现代教育技术中心,西安710127 [2]西北大学信息科学与技术学院新型网络智能信息服务国家地方联合工程研究中心,西安710127 [3]西安工程大学计算机科学学院新型网络智能信息服务国家地方联合工程研究中心,西安710600

出　　处：《计算机研究与发展》2020年第4期767-777,共11页Journal of Computer Research and Development

基　　金：国家自然科学基金项目(61572401);赛尔网络下一代互联网技术创新项目(NGⅡ20150403).

摘　　要：网络防御策略是决定网络安全防护效果的关键因素,现有的网络防御决策研究的是完全理性前提条件以及攻防效益函数参数选择等方面,对实际网络攻防中信息不对称、法律惩戒等因素存在模型偏差,降低了策略的实用性与可靠性.结合实际问题,在有限理性的前置条件基础上构建禁忌随机博弈模型,引入了禁忌搜索方法对随机博弈进行有限理性的分析,并设计具有记忆功能的搜索方法,通过禁忌表数据结构实现记忆功能,并利用数据驱动的记忆结合博弈模型得出最优防御策略.实验结果表明:该方法在攻防收益量化方面提高了精准度,防御效益相对于现有典型的方法提高了准确度,方法空间复杂度优于强化学习等典型方法.The network defence strategy is the key factor to determine the effect of network security protection.In terms of the rational precondition of the existing network defence decision-making research and the parameter selection of the attack and defence benefit function,there are model deviations for the factors such as information asymmetry and legal punishment in the actual network attack and defence,which reduces the practicability and reliability of the strategy.In this paper,the Tabu random game model is constructed on the basis of the preconditions of bounded rationality,the Tabu search algorithm is introduced to analyze the bounded rationality of random game,and a search algorithm with memory function is designed.The data structure of the Tabu table is used to realize the memory function,and the data-driven memory combined with the game model is used to get the optimal defence strategy.The experimental results show that this method improves the accuracy in the quantification of attack and defence benefits,improves the accuracy of defence benefits compared with the existing typical methods,and the algorithm space complexity is better than the reinforcement learning and other typical algorithms.

关键词：随机博弈禁忌搜索网络攻防防御策略有限理性

分类号：TP393.08[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于随机博弈与禁忌搜索的网络防御策略选取被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于随机博弈与禁忌搜索的网络防御策略选取 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于随机博弈与禁忌搜索的网络防御策略选取被引量：6