检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Delin Luo Zihao Fan Ziyi Yang Yang Xu
机构地区:[1]School of Aerospace Engineering,Xiamen University,Xiamen 361102,China [2]School of Civil Aviation,Northwestern Polytechnical University,Xi'an 710072,China
出 处:《Defence Technology(防务技术)》2024年第5期187-197,共11页Defence Technology
基 金:supported in part by the National Key Laboratory of Air-based Information Perception and Fusion and the Aeronautical Science Foundation of China (Grant No. 20220001068001);National Natural Science Foundation of China (Grant No.61673327);Natural Science Basic Research Plan in Shaanxi Province,China (Grant No. 2023-JC-QN-0733);China IndustryUniversity-Research Innovation Foundation (Grant No. 2022IT188)。
摘 要:Aiming at the problem of multi-UAV pursuit-evasion confrontation, a UAV cooperative maneuver method based on an improved multi-agent deep reinforcement learning(MADRL) is proposed. In this method, an improved Comm Net network based on a communication mechanism is introduced into a deep reinforcement learning algorithm to solve the multi-agent problem. A layer of gated recurrent unit(GRU) is added to the actor-network structure to remember historical environmental states. Subsequently,another GRU is designed as a communication channel in the Comm Net core network layer to refine communication information between UAVs. Finally, the simulation results of the algorithm in two sets of scenarios are given, and the results show that the method has good effectiveness and applicability.
关 键 词:Reinforcement learning UAV Maneuver decision GRU Cooperative control
分 类 号:V279[航空宇航科学与技术—飞行器设计] E91[军事]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.147