基于分配策略优化算法的智能防空任务分配

Intelligent Air Defense Task Assignment Based on Assignment Strategy Optimization Algorithm

作　　者：刘家义王刚[1] 付强[1] 郭相科[1] 王思远 Liu Jiayi;Wang Gang;Fu Qiang;Guo Xiangke;Wang Siyuan(Air and Missile Defense College,Air Force Engineering University,Xi'an 710051,China;Graduate College,Air Force Engineering University,Xi'an 710051,China)

机构地区：[1]空军工程大学防空反导学院,陕西西安710051 [2]空军工程大学研究生院,陕西西安710051

出　　处：《系统仿真学报》2023年第8期1705-1716,共12页Journal of System Simulation

基　　金：国家自然科学基金(62106283)。

摘　　要：针对分配策略最优算法在大规模场景中求解速度不足的问题,基于马尔可夫决策过程,将深度强化学习与其相结合,将大规模防空任务分配问题进行智能化求解。根据大规模防空作战特点,利用马尔可夫决策过程对智能体进行建模,构建数字战场仿真环境;设计防空任务分配智能体,通过近端策略优化算法,在数字战场仿真环境中进行训练。以大规模防空对抗任务为例,验证了该方法的可行性和优越性。Aiming at the insufficient solving speed of assignment strategy optimization algorithm in largescale scenarios,deep reinforcement learning is combined with Markov decision process to carry out the intelligent large-scale air defense task assignment.According to the characteristics of large-scale air defense operations,Markov decision process is used to model the agent and a digital battlefield simulation environment is built.Air defense task assignment agent is designed and trained in digital battlefield simulation environment through proximal policy optimization algorithm.The feasibility and advantage of the method are verified by taking a large-scale ground-to-air countermeasure mission as an example.

关键词：分配策略优化算法任务分配马尔可夫决策过程深度强化学习智能体

分类号：TP391.9[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于分配策略优化算法的智能防空任务分配

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于分配策略优化算法的智能防空任务分配

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索