一种面向空间非合作目标的强化学习多臂协同俘获策略研究被引量：1

A Reinforcement Learning Capture Strategy for Non-cooperative Targets Based on a Multi-arm Synergy Method

作　　者：张柄汉王琛彭兆涛张夷斋张帆[2] ZHANG Binghan;WANG Chen;PENG Zhaotao;ZHANG Yizhai;ZHANG Fan(School of Engineering and Machinery,Chang’an University,Xi’an 710054,China;School of Astronautics,Northwestern Polytechnic University,Xi’an 710072,China)

机构地区：[1]长安大学工程机械学院,西安710054 [2]西北工业大学航天学院,西安710072

出　　处：《宇航学报》2023年第12期1934-1943,共10页Journal of Astronautics

基　　金：国家自然科学基金(62173275,62222313)。

摘　　要：针对空间非合作目标清除任务中的目标适应性以及俘获动作规划复杂性等问题,提出了一种基于强化学习方法并结合“多臂分组协同”机制的包络俘获策略。首先构建了多臂俘获机构的物理模型和运动学模型,之后利用SAC(soft actor-critic)算法并引入前演训练(PT)设计了强化学习控制器,接着基于“多臂分组协同”奖励机制设计奖励函数以训练得到最优俘获动作。为了验证俘获策略对单目标作业的高效性和对多目标作业的高适应性,对各种目标分别进行仿真实验。仿真结果表明:所得的俘获策略可以对多种构型的目标实现高效、高适应地俘获。In order to solve the problem of target adaptability and complexity of capture action planning in space noncooperative target clearing tasks,an envelope capture strategy based on reinforcement learning combined with multi-arm group coordination mechanism is proposed.Firstly,the physical model and kinematic model of the multi-arm trap mechanism are constructed,and then the reinforcement learning controller is designed by using the soft-actor-critic(SAC) algorithm and introducing pretraining(PT) method.Then the reward function is designed based on the “multi-arm grouping cooperation” reward mechanism to train the optimal capture action.In order to verify the high efficiency of capture strategy for single target operation and high adaptability for multi-target operation,simulation experiments are carried out on various targets respectively.Simulation results show that the proposed capture strategy can capture targets of various configurations efficiently and adaptively.

关键词：空间非合作目标空间俘获策略强化学习包络俘获多臂协同

分类号：TP241[自动化与计算机技术—检测技术与自动化装置] V443[自动化与计算机技术—控制科学与工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种面向空间非合作目标的强化学习多臂协同俘获策略研究被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

一种面向空间非合作目标的强化学习多臂协同俘获策略研究 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

一种面向空间非合作目标的强化学习多臂协同俘获策略研究被引量：1