攻击角度约束下的分布式强化学习制导方法被引量：6

A Distributed Reinforcement Learning Guidance Method under Impact Angle Constraints

作　　者：李博皓安旭曼杨晓飞吴云洁[1,2,3] 李国飞 LI Bohao;AN Xuman;YANG Xiaofei;WU Yunjie;LI Guofei(State Key Laboratory of Virtual Reality Technology and System,Beihang University,Beijing 100191,China;School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China;Science and Technology on Aircraft Control Laboratory,Beijing 100191,China;School of Astronautics,Northwestern Polytechnical University,Xi’an 710072,China)

机构地区：[1]北京航空航天大学虚拟现实技术与系统国家重点实验室,北京100191 [2]北京航空航天大学大学自动化科学与电气工程学院,北京100191 [3]飞行器控制一体化技术重点实验室,北京100191 [4]西北工业大学航天学院,西安710072

出　　处：《宇航学报》2022年第8期1061-1069,共9页Journal of Astronautics

基　　金：国家自然科学基金(62003021);中央高校基本科研业务项目(D5000210830)。

摘　　要：为提高导弹在攻击角度约束下对目标的打击效能,提出了一种基于深度确定性策略梯度算法的分布式强化学习制导策略。为了最大限度地减小攻击角度误差,设计了一种新的奖励函数,使导弹在满足视场角约束的同时,视线角向期望值收敛。此外,为了增强强化学习模型的泛化能力,提出了一种分布式探索策略,提高了模型训练过程中对环境的探索效率。仿真结果验证了所提出的分布式强化学习制导方法能够在固定攻击角度约束下实现对目标的精准打击。与传统制导律相比,所提制导方法的攻击角度误差更小,收敛速度更快。In order to improve the target hitting effect of missile with the impact angle fixed,a distributed reinforcement learning guidance strategy based on deep deterministic policy gradient algorithm is proposed.To minimize the impact angle error,a new reward function is designed to make the line-of-sight angle converge to the expected value while meeting the field-of-view angle constraint.In addition,in order to enhance the generalization ability of the reinforcement learning model,a distributed exploration strategy is proposed to improve the efficiency of environment exploration during model training.The simulation results verify that the proposed distributed reinforcement learning guidance method can achieve accurate attack on the target under the constraint of fixed impact angle.Compared with the traditional guidance law,the impact angle error of the proposed guidance law is smaller and the convergence rate is faster.

关键词：导弹制导强化学习攻击角度梯度算法

分类号：TJ765[兵器科学与技术—武器系统与运用工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

攻击角度约束下的分布式强化学习制导方法被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

攻击角度约束下的分布式强化学习制导方法 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

攻击角度约束下的分布式强化学习制导方法被引量：6