基于深度强化学习的雷达智能决策生成算法被引量：1

Radar Intelligent Decision Generation Algorithm Based on Deep Reinforcement Learning

作　　者：赵家琛张劲东[1] 李梓瑜 ZHAO Jiachen;ZHANG Jindong;LI Ziyu(School of Electronic and Information Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 211100,China)

机构地区：[1]南京航空航天大学电子信息工程学院,南京211100

出　　处：《现代雷达》2022年第12期25-33,共9页Modern Radar

基　　金：国家自然科学基金资助项目(62171220)。

摘　　要：针对雷达系统面临的干扰场景复杂多变、人工设计抗干扰策略性能难以保证以及实时性不高的问题,构建了基于深度强化学习的智能决策生成模型,设计了有针对性的动作集、状态集和奖励函数。同时提出了基于双深度Q网络(DDQN)的决策网络训练算法,用于克服深度Q网络(DQN)算法中目标网络与评估网络相耦合导致Q值的过估计。仿真结果表明:与DQN、Q学习、人工制定策略与遍历策略库等方法相比,文中所设计的智能决策模型和训练方法对干扰的抑制效果好,泛化能力更强,反应时间更快,有效地提升了雷达自主决策能力。In order to solve the problems faced by radar system such as complex jamming scenes, low reliability and bad real-time performance, an intelligent decision generation model is constructed based on Deep Reinforcement Learning, where targeted action set, state set and reward function are designed. After that, a decision network training algorithm based on double deep Q-network is proposed to overcome the problem of Q value over estimation which caused by the coupling of target network and evaluation network in Deep Q-network(DQN). The simulation results show that, compared with DQN, Q learning and traversal algorithm, the intelligent decision model and training method designed in this paper have better interference suppression effect, stronger generalization ability and faster response time, and effectively improve the radar independent decision-making ability.

关键词：雷达智能决策深度强化学习深度Q网络双深度Q网络

分类号：TN972[电子电信—信号与信息处理]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度强化学习的雷达智能决策生成算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度强化学习的雷达智能决策生成算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于深度强化学习的雷达智能决策生成算法被引量：1