多Agent协作的强化学习模型和算法被引量：6

Reinforcement Learning Model and Algorithm Based on Multi-agent Cooperation

出　　处：《计算机科学》2006年第12期156-158,186,共4页Computer Science

基　　金：国家自然科学基金项目资助(编号:60573169)。

摘　　要：结合强化学习技术讨论了多Agent协作学习的过程,构造了一个新的多Agent协作学习模型。在这个模型的基础上,提出一个多Agent协作学习算法。算法充分考虑了多Agent共同学习的特点,使得Agent基于对动作长期利益的估计来预测其动作策略,并做出相应的决策,进而达成最优的联合动作策略。最后,通过对猎人-猎物追逐问题的仿真试验验证了该算法的收敛性,表明这种学习算法是一种高效、快速的学习方法。The multi-agent cooperative learning process based on Reinforcement Learning is addressed and a new multiagent cooperative learning model is proposed. Based on this model, a cooperative learning algorithm is introduced. This algorithm pays fully attention to multl-agent cooperative learning together simultaneity, so it can make each agent predict its action policy based on the estimation on its action＇s long-time reward, At last relevant decisions to be the best associated action policy is made. We conduct a series of empirical evaluation of the algorithm on the hunter-prey problem to validate its astringency. The result shows this algorithm is an efficient and fast method for multi-agent learning.

关键词：协作学习强化学习多AGENT学习学习模型学习算法

分类号：TP393[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

多Agent协作的强化学习模型和算法被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

多Agent协作的强化学习模型和算法 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

多Agent协作的强化学习模型和算法被引量：6