CIDDPG的多智能体通信优化方法研究被引量：1

Research on Multi-Agent Communication Optimization Method Based on CIDDPG

作　　者：耿俊香姜静魏胜楠段昶 GENG Junxiang;JIANG Jing;WEI Shengnan;DUAN Chang(Shenyang Ligong University,Shenyang 110159,China)

机构地区：[1]沈阳理工大学自动化与电气工程学院,沈阳110159

出　　处：《沈阳理工大学学报》2021年第4期29-34,共6页Journal of Shenyang Ligong University

摘　　要：多智能体系统在进行协作时,会面临智能体数量多导致博弈关系复杂、不能及时做出正确决策的问题,高效的通信是多智能体协作的有效方式。提出一种基于通信的高效信息学习算法—CIDDPG,在多智能体DDPG算法上建立通信机制,实现智能体之间的沟通交流;并在DDPG算法的策略网络中加入调度模块,以修剪无用信息,提高通信效率;在价值网络中引入注意力机制,有选择地关注来自其他智能体的信息,使其在复杂的环境中高效实现智能体间合作、竞争等互动。两种不同场景的实验证明,CIDDPG算法能够获得比其他算法更高的平均奖励值,且收敛速度快。When multi-agent system cooperates, it can face the large number of agents, which leads to complex game relationship and can′t make correct decisions in time. Efficient communication is an effective way of multi-agent cooperation. An efficient information learning algorithm based on communication is proposed——CIDDPG,which is to establish a communication mechanism on the multi-agent DDPG algorithm to realize the communication between agents. And scheduling module is added to the policy network of multi-agent DDPG algorithm, so as to eliminate useless information and improve communication efficiency. In order to selectively pay attention to information from other subjects, attention mechanism is introduced into value network, so that in the complex environment such as cooperation and competition, the interaction between subjects can be effectively realized.Through experiments in two different scenarios, it is proved that CIDDPG algorithm can obtain higher average reward value than other algorithms, and the convergence speed is fast.

关键词：多智能体系统高效通信调度模块注意力机制

分类号：TP181[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

CIDDPG的多智能体通信优化方法研究被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

CIDDPG的多智能体通信优化方法研究 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

CIDDPG的多智能体通信优化方法研究被引量：1