检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨加秀 李新凯 张宏立[1] 王昊 YANG Jiaxiu;LI Xinkai;ZHANG Hongli;WANG Hao(School of Electrical Engineering,Xinjiang University,Urumqi 830017,China)
出 处:《航空学报》2024年第10期243-259,共17页Acta Aeronautica et Astronautica Sinica
基 金:国家自然科学基金(62263030);新疆维吾尔自治区自然科学基金青年科学基金(2022D01C86)。
摘 要:针对系统模型动态不确定的高阶异构无人集群系统在切换通信拓扑下的时变编队控制问题,提出一种基于积分强化学习的最优分布式分层编队控制方法。结合时变编队切换向量构建多四旋翼无人机系统与多无人车系统的增广系统,将异构集群系统的时变编队控制问题转化为镇定问题。引入带折扣因子的价值函数,将异构集群系统的镇定问题转化为最优控制问题。在不破坏一致性分布式编队控制协议的基础上,仅替换反馈增益参数并对其进行取平均操作,以得到整个异构集群的最优时变编队切换控制协议。利用单网络“动作网络-评价网络”结构,结合积分强化学习算法和分布式控制方法,在线实时更新分布式时变编队切换控制器的反馈增益。通过理论证明和仿真实验验证了所设计控制方案的有效性和优越性。To address the problem of time-varying formation control of high-order heterogeneous unmanned cluster systems with uncertain system model dynamics and switching communication topology,an optimal distributed hierarchical formation control method is proposed based on integral reinforcement learning.The time-varying formation control problem for heterogeneous cluster systems is transformed into a stabilization problem by using time-varying formation switching vectors to construct an augmented system of multi-quadrotor Unmanned Aircraft System(UAS)with multi-unmanned vehicle systems.The value function with discount factor is introduced to transform the stabilization problem of the heterogeneous clustered system into an optimal control problem.Only the feedback gain parameters are replaced and averaged to obtain the optimal time-varying formation switching control protocol for the whole heterogeneous cluster without destroying the consistent distributed formation control protocol.The feedback gain of the distributed time-varying formation switching controller is updated online in real time using a single-network“actor networkcritic network”structure,combined with the integral reinforcement learning algorithm and the distributed control method.The effectiveness and superiority of the proposed control scheme are verified by theoretical proof and simulation experiments.
关 键 词:积分强化学习 异构集群 时变编队控制 分布式控制 切换拓扑 最优控制
分 类 号:V249[航空宇航科学与技术—飞行器设计]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.49