基于强化学习的冲压发动机飞行器爬升段轨迹优化控制被引量：1

Reinforcement learning based climb trajectory optimal control of ramjet aircraft

作　　者：周国峰严大卫[2] 梁卓 ZHOU Guofeng;YAN Dawei;LIANG Zhuo(College of Aerospace Engineering,Nanjing University of Aeronautics and Astronautics,Nanjing 210016,China;China Academy of Launch Vehicle Technology,Beijing 100076,China)

机构地区：[1]南京航空航天大学航空学院,南京210016 [2]中国运载火箭技术研究院,北京100076

出　　处：《中国惯性技术学报》2022年第1期135-140,共6页Journal of Chinese Inertial Technology

基　　金：装备发展领域基金(41412050)。

摘　　要：冲压发动机飞行器爬升过程中发动机性能随飞行状态时变,且易受动力性能偏差、气动偏差和风干扰的耦合影响,传统的方法难以给出能量最优的爬升段轨迹解。针对该问题,提出了一种基于强化学习的轨迹优化控制方法。首先构建了基于近端策略优化(PPO)的强化学习任务模型,将轨迹优化问题转化为基于状态给出最优动作策略的强化学习问题,提出了对未到达目标区域样本赋予广义距离奖励的方法来解决奖励稀疏性问题;通过在控制器训练中引入初值采样来降低初值敏感性;提出了将线性扩张状态观测器(LESO)与强化学习相结合的方法,通过对干扰进行观测和补偿提升控制器抗干扰能力。仿真结果表明,采用所提出的算法后,终端约束误差缩小了60%,可为复杂环境下的冲压发动机轨迹优化控制提供参考。In the process of ramjet aircraft climbing, the engine performance varies with the flight state, and is susceptible to the coupling effects of power performance, aerodynamic and wind. Therefore, it is difficult to obtain the optimal energy trajectory solution by traditional methods. To solve the problem, a trajectory optimization control method based on reinforcement learning is proposed. Firstly, a reinforcement learning model based on proximal policy optimization(PPO) is constructed, which transforms the trajectory optimization problem into a state-based reinforcement learning problem with optimal action strategy, and a generalized distance reward method is proposed to solve the problem of reward sparsity. The sensitivity of initial value is reduced by introducing initial value sampling in training. A method combining linear extended state observer(LESO) with reinforcement learning is proposed to improve the anti-jamming ability by observing and compensating the interference. Simulation results show that the terminal state accuracy is improved by 60% by using the proposed algorithm, which can provide a reference for ramjet trajectory optimization control in complex environments.

关键词：冲压发动机轨迹优化强化学习线性扩张状态观测器

分类号：V279[航空宇航科学与技术—飞行器设计]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的冲压发动机飞行器爬升段轨迹优化控制被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于强化学习的冲压发动机飞行器爬升段轨迹优化控制 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于强化学习的冲压发动机飞行器爬升段轨迹优化控制被引量：1