基于分层的智能建模方法的多机空战行为建模被引量：1

Research on Multi-aircraft Air Combat Behavior Modeling Based on Hierarchical Intelligent Modeling Methods

作　　者：王宇琨王泽董力维李妮[1] Wang Yukun;Wang Ze;Dong Liwei;Li Ni(School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China)

机构地区：[1]北京航空航天大学自动化科学与电气工程学院,北京100191

出　　处：《系统仿真学报》2023年第10期2249-2261,共13页Journal of System Simulation

摘　　要：针对多机空战对抗场景中高维状态-行为空间约束下兵力博弈决策困难的问题,采用基于深度强化学习的兵力智能体决策生成策略,提出面向兵力智能博弈的态势认知和奖励回报生成算法,构建基于混合的智能建模方法的行为建模分层框架。解决了强化学习过程中存在的稀疏奖励技术难点,为解决大规模、多机型、要素多的空战问题提供一种可行的强化学习训练方法。In response to the problem of the difficulty of decision-making in the game of force under the constraints of high-dimensional state-space in multi-machine air combat confrontation scenarios,a force intelligent agent decision-making generation strategy based on deep reinforcement learning is adopted.The developing situational cognition and reward feedback generation algorithms for force intelligent game are proposed,a behavior modeling hierarchical framework based on hybrid intelligence modeling method is constructed,which solve the technical difficulty of sparse reward in the reinforcement learning process.It provides an feasible reinforcement learning training method that can solve the large-scale,multi-model,and multi-element air combat problems.

关键词：作战仿真多智能体深度强化学习非稀疏奖励函数

分类号：TP391.9[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于分层的智能建模方法的多机空战行为建模被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于分层的智能建模方法的多机空战行为建模 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于分层的智能建模方法的多机空战行为建模被引量：1