基于强化学习的多智能体协作实现  被引量:2

Multi-agent cooperation based on reinforcement learning

在线阅读下载全文

作  者:陈雪江[1] 杨东勇[1] 

机构地区:[1]浙江工业大学信息工程学院,浙江杭州310032

出  处:《浙江工业大学学报》2004年第5期516-519,572,共5页Journal of Zhejiang University of Technology

基  金:浙江省自然科学基金项目(601078)

摘  要:基于马尔科夫过程的强化学习作为一种在线学习方式,能够很好地应用于单智能体环境中。但是由于强化学习理论的限制,在多智能体系统中马尔科夫过程模型不再适用,因此强化学习不能直接用于多智能体的协作学习问题。本文提出了多智能体协作的两层强化学习方法。该方法主要通过在单个智能体中构筑两层强化学习单元来实现。第一层强化学习单元负责学习智能体的联合任务协作策略,第二层强化学习单元负责学习在本智能体看来是最有效的行动策略。所提出的方法应用于3个智能体协作抬起圆形物体的计算机模拟中,结果表明所提出的方法比采用传统强化学习方法的智能体协作得更好。Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment. However, due to the theoretical limitation that it assumes that an environment is Markovian, traditional reinforcement learning algorithms cannot be applied directly to multi-agent system. In this paper, a two-layer reinforcement learning method for multi-agent cooperation is presented. The proposed method is realized by adding two-layer reinforcement learning units to every agent. The first layer is for learning global cooperation strategy, and the second layer is for learning efficient action policy in one's own view. An experiment that three agents raise a disk-like object cooperatively has been done. Results show that the cooperative performance with the presented method is better than that using traditional reinforcement learning.

关 键 词:强化学习 多智能体系统 协作策略 马尔科夫过程 单元 在线学习 模型 习作 协作学习 物体 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程] TP242[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象