检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Longwei Xu Gang Zhang Shi Qiu Xibin Cao
机构地区:[1]Research Center of Satellite Technology,Harbin Institute of Technology,Harbin 150001,PR China
出 处:《Space(Science & Technology)》2023年第1期362-373,共12页空间科学与技术(英文)
基 金:supported in part by the Key Research and Development Plan of Heilongjiang Province under Grant GZ20210120.
摘 要:A reinforcement learning-based approach is proposed to design the multi-impulse rendezvous trajectories in linear relative motions.For the relative motion in elliptical orbits,the relative state propagation is obtained directly from the state transition matrix.This rendezvous problem is constructed as a Markov decision process that reflects the fuel consumption,the transfer time,the relative state,and the dynamical model.An actor-critic algorithm is used to train policy for generating rendezvous maneuvers.The results of the numerical optimization(e.g.,differential evolution)are adopted as the expert data set to accelerate the training process.By deploying a policy network,the multi-impulse rendezvous trajectories can be obtained on board.Moreover,the proposed approach is also applied to generate a feasible solution for many impulses(e.g.,20 impulses),which can be used as an initial value for further optimization.The numerical examples with random initial states show that the proposed method is much faster and has slightly worse performance indexes when compared with the evolutionary algorithm.
关 键 词:optimization process OPTIMAL
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.23.94.64