检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Cuijuan ZHANG Lianghao JI Shasha YANG Xing GUO Huaqing LI
机构地区:[1]Chongqing Key Laboratory of Image Cognition,Chongqing University of Posts and Telecommunications,Chongqing 400065,China [2]Key Laboratory of Intelligent Perception and Computing of Anhui Province,Anqing Normal University,Anqing 246133,China [3]College of Electronic and Information Engineering,Southwest University,Chongqing 400715,China
出 处:《Science China(Information Sciences)》2025年第1期303-318,共16页中国科学(信息科学)(英文版)
基 金:supported in part by National Natural Science Foundation of China(Grant Nos.62276036,62221005,62006031);Major Project of Scientific and Technological Research Program of Chongqing Municipal Education Commission(Grant No.KJZD-M202100602);Project of Natural Science Foundation of Chongqing(Grant Nos.cstc2021jcyj-msxmX1043,CSTB2024NSCQ-LZX0118);Anhui Provincial Research Programming Project(Grant Nos.2022AH051039,2022AH051054);Doctoral Talent Training Project of Chongqing University of Posts and Telecommunications(Grant No.BYJS202210).
摘 要:This study uses event-triggered(ET)and reinforcement learning methods to investigate the optimal consensus control problem for cooperative-competitive multiagent systems.It proposes a novel distributed ET control strategy,which relies on a prioritized experience replay(PER)policy.This strategy not only conserves communication resources but also ensures acceptable system performance.To implement the proposed method,actor-critic(AC)dual-structured neural networks(NNs)are used to approximate the value function and control policy.In the AC NNs,the weight estimates for the NNs are updated at the moment of event triggering,resulting in a nonperiodic weight adjustment pattern.This approach decreases the computational cost in comparison with the traditional ET mechanism.The PER-based ET mechanism makes full use of valid historical data and effectively establishes a balance between system performance and communication resource conservation.Moreover,it does not require the following two conditions in most existing studies:(1)requirement of the system dynamics model to be known,and(2)persistent excitation.In addition,Zeno behavior is excluded from this study.Finally,a simulation is conducted to confirm the validity of the suggested approach.
关 键 词:multiagent systems event-triggered mechanism prioritized experience replay dual-structured neural networks optimal consensus control
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.11