Robocup半场防守中的一种强化学习算法

A Reinforcement Learning Method for Robocup Soccer Half Field Defense

出　　处：《计算机技术与发展》2008年第1期59-62,共4页Computer Technology and Development

基　　金：国家自然科学基金(50575031)

摘　　要：Robocup仿真比赛是研究多Agent之间协作和对抗理论的优秀平台,提高Agent的防守能力是一个具有挑战性的问题。为制定合理的防守策略,将Robocup比赛中的一个子任务——半场防守任务分解为多个一对一防守任务,采用了基于Markov对策的强化学习方法解决这种零和交互问题,给出了具体的学习算法。将该算法应用到3D仿真球队——大连理工大学梦之翼(Fantasia)球队,在实际比赛过程中取得了良好效果。验证了采用Markov零和对策的强化学习算法在一对一防守中优于手工代码的结论。Robocup soccer simulation is an excellent platform in which colhboration and counterwork among multi - agent are studied. It is a challenging problem to improve agent＇s defense ability. In order to design reasonable defending policy, decompose a subtask, half field defense, into some one- vs-one defense subtask and pose it as a problem of zero-sum Markov games. In this paper, a reinforcement learning method based on Markov game is developed and implemented in 3D simulation soccer team——DUT Fantasia. In real matches, this arithmetic is approved to be efficient and better than manual - coding in one- vs- one defense subtask.

关键词：ROBOCUP 强化学习 MARKOV对策零和对策

分类号：TP242.6[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Robocup半场防守中的一种强化学习算法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

Robocup半场防守中的一种强化学习算法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索