基于分配策略优化算法的智能防空任务分配  

Intelligent Air Defense Task Assignment Based on Assignment Strategy Optimization Algorithm

在线阅读下载全文

作  者:刘家义 王刚[1] 付强[1] 郭相科[1] 王思远 Liu Jiayi;Wang Gang;Fu Qiang;Guo Xiangke;Wang Siyuan(Air and Missile Defense College,Air Force Engineering University,Xi'an 710051,China;Graduate College,Air Force Engineering University,Xi'an 710051,China)

机构地区:[1]空军工程大学防空反导学院,陕西西安710051 [2]空军工程大学研究生院,陕西西安710051

出  处:《系统仿真学报》2023年第8期1705-1716,共12页Journal of System Simulation

基  金:国家自然科学基金(62106283)。

摘  要:针对分配策略最优算法在大规模场景中求解速度不足的问题,基于马尔可夫决策过程,将深度强化学习与其相结合,将大规模防空任务分配问题进行智能化求解。根据大规模防空作战特点,利用马尔可夫决策过程对智能体进行建模,构建数字战场仿真环境;设计防空任务分配智能体,通过近端策略优化算法,在数字战场仿真环境中进行训练。以大规模防空对抗任务为例,验证了该方法的可行性和优越性。Aiming at the insufficient solving speed of assignment strategy optimization algorithm in largescale scenarios,deep reinforcement learning is combined with Markov decision process to carry out the intelligent large-scale air defense task assignment.According to the characteristics of large-scale air defense operations,Markov decision process is used to model the agent and a digital battlefield simulation environment is built.Air defense task assignment agent is designed and trained in digital battlefield simulation environment through proximal policy optimization algorithm.The feasibility and advantage of the method are verified by taking a large-scale ground-to-air countermeasure mission as an example.

关 键 词:分配策略优化算法 任务分配 马尔可夫决策过程 深度强化学习 智能体 

分 类 号:TP391.9[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象