检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:DONG Yubo CUI Tao ZHOU Yufan SONG Xun ZHU Yue DONG Peng 董玉博;崔涛;周禹帆;宋勋;祝月;董鹏(School of Aeronautics and Astronautics,Shanghai Jiao Tong University,Shanghai,200240,China;Beijing Institute of Electronic System Engineering,Beijing,100854,China)
机构地区:[1]School of Aeronautics and Astronautics,Shanghai Jiao Tong University,Shanghai,200240,China [2]Beijing Institute of Electronic System Engineering,Beijing,100854,China
出 处:《Journal of Shanghai Jiaotong university(Science)》2024年第4期646-655,共10页上海交通大学学报(英文版)
基 金:National Natural Science Foundation of China(Nos.61803260,61673262 and 61175028)。
摘 要:Multi-agent reinforcement learning has recently been applied to solve pursuit problems.However,it suffers from a large number of time steps per training episode,thus always struggling to converge effectively,resulting in low rewards and an inability for agents to learn strategies.This paper proposes a deep reinforcement learning(DRL)training method that employs an ensemble segmented multi-reward function design approach to address the convergence problem mentioned before.The ensemble reward function combines the advantages of two reward functions,which enhances the training effect of agents in long episode.Then,we eliminate the non-monotonic behavior in reward function introduced by the trigonometric functions in the traditional 2D polar coordinates observation representation.Experimental results demonstrate that this method outperforms the traditional single reward function mechanism in the pursuit scenario by enhancing agents’policy scores of the task.These ideas offer a solution to the convergence challenges faced by DRL models in long episode pursuit problems,leading to an improved model training performance.
关 键 词:multi-agent reinforcement learning deep reinforcement learning(DRL) long episode reward function
分 类 号:TP242[自动化与计算机技术—检测技术与自动化装置]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.118.195.19