基于有模型强化学习的飞行器制导技术研究

Research on Aircraft Guidance Technology Based on Model-Based Reinforcement Learning

作　　者：滕庆骅惠俊鹏李天任杨奔 TENG Qinghua;HUI Junpeng;LI Tianren;YANG Ben(Research&Development Center,China Academy of Launch Vehicle Technology,Beijing 100076,China;Beijing Institute of Space Long March Vehicle,Beijing 100076,China)

机构地区：[1]中国运载火箭技术研究院研究发展中心,北京100076 [2]北京长征航天飞行器研究所,北京100076

出　　处：《宇航总体技术》2025年第2期26-34,共9页Astronautical Systems Engineering Technology

摘　　要：面向避障、绕飞等任务驱动的飞行器在线轨迹,为了提升制导性能,适应快速变化的复杂场景,聚焦于充分利用飞行器模型中的已知信息,基于iLQR这种有模型强化学习方法,设计了智能化的制导方式。与无模型强化学习相比,有模型强化学习的可解释性好,训练难度低。在单飞行器制导仿真中,相比TD3算法,iLQR方法飞行过程平均制导误差增加了28.07%,中末交班点误差降低到12.35%,提升幅度巨大;在多飞行器编队保持问题上,相比TD3算法,iLQR方法跟踪效果提升巨大,平均误差不超过TD3算法的22.67%,最大误差不超过TD3算法的15.44%。For online-planned aircraft trajectory by tasks such as obstacle avoidance and detour flights,to improve guidance performance and adapt to rapidly changing complex scenarios,this paper designs an intelligent guidance method based on the iLQR model-based reinforcement learning approach,which could fully utilize the known information in the aircraft model.Compared to model-free reinforcement learning,model-based reinforcement learning has better interpretability and lower training difficulty.In single-aircraft guidance simulations,although the average guidance error during flight is 28.07%worse than the TD3 algorithm,the error at the final hand-over point reduce to 12.35%.In the multi-aircraft formation maintenance problem,the iLQR method shows a substantial improvement in tracking performance,whose average error is 22.67%and maximum error is 15.44%of TD3 algorithm.

关键词：iLQR算法有模型强化学习标准轨迹制导强化学习制导编队保持

分类号：V448.1[航空宇航科学与技术—飞行器设计]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于有模型强化学习的飞行器制导技术研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于有模型强化学习的飞行器制导技术研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索