基于D3QN的火力方案优选方法

Optimization Selection Method of Fire Plan Based on D3QN

作　　者：佘维[1,2,3] 岳瀚田钊孔德锋[4] SHE Wei;YUE Han;TIAN Zhao;KONG Defeng(School of Cyber Security and Engineering,Zhengzhou University,Zhengzhou 450000,China;Songshan Laboratory,Zhengzhou,450000,China;Zhengzhou Key Laboratory of Blockchain and Data Intelligence,Zhengzhou 450000,China;Institute of Engineering Protection,IDE,AMS,PLA,Luoyang 471023,China)

机构地区：[1]郑州大学网络空间安全学院,郑州450000 [2]嵩山实验室,郑州450000 [3]郑州市区块链与数据智能重点实验室,郑州450000 [4]军事科学院国防工程研究院工程防护研究所,河南洛阳471023

出　　处：《火力与指挥控制》2024年第8期166-174,共9页Fire Control & Command Control

基　　金：嵩山实验室预研项目(YYYY022022003);河南省重点研发与推广专项基金资助项目(212102310039)。

摘　　要：针对在多类弹药协同攻击地面工事类目标任务中火力方案优选效率低的问题,提出一种基于双层决斗DQN(dueling double deep Q network,D3QN)的火力方案优选方法。该方法将打击过程建模为马尔科夫决策过程(Markov decision processes,MDP),设计其状态空间和动作空间,设计综合奖励函数激励火力方案生成策略优化,使智能体通过强化学习框架对策略进行自主训练。仿真实验结果表明,该方法对地面工事类目标的火力方案进行决策,相较于传统启发式智能算法能够获得较优的火力方案,其计算效率和结果的稳定性相较于传统深度强化学习算法具有更明显的优势。To address the problem of inefficient fire plan optimization in the task of coordinated attack on ground fortification-type targets by multiple types of munitions,a fire plan optimization method based on the Dueling Double Deep Q Network(D3QN)is proposed.The method models the striking process as Markov Decision Processes(MDPs),firstly its state space and action space are designed,then a comprehensive reward function is designed to stimulate the optimization of the fire plan generation strategy,and finally the intelligent body is enabled to train the strategy autonomously through a reinforcement learning framework.The simulation experiment results show that the method can achieve more optimal fire solutions for ground fortification type targets than that of the traditional heuristic intelligence algorithms,and its computational efficiency and stability of results are more obviously advantageous than that of the traditional deep reinforcement learning algorithms.

关键词：深度强化学习深度Q网络 D3QN 组合优化火力方案优选

分类号：TJ015[兵器科学与技术—兵器发射理论与技术] E91[军事]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于D3QN的火力方案优选方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于D3QN的火力方案优选方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索