XCSG在多机器人强化学习中的应用被引量：2

Applications of XCSG in Multi-robot Reinforcement Learning

机构地区：[1]郑州成功财经学院信息工程系,郑州451200 [2]南京理工大学计算机科学与技术学院,南京210094 [3]商丘工学院信息与电子学院,商丘476000

出　　处：《计算机科学》2013年第8期249-251,292,共4页Computer Science

基　　金：国家自然科学基金(90820004)资助

摘　　要：XCS分类器在解决机器人强化学习方面已显示出较强的能力,但在多机器人领域仅局限于MDP环境,只能解决环境空间较小的学习问题。提出了XCSG来解决多机器人的强化学习问题。XCSG建立低维的逼近函数,梯度下降技术利用在线知识建立稳定的逼近函数,使Q-表格一直保持在稳定低维状态。逼近函数Q不仅所需的存储空间更小,而且允许机器人在线对已获得的知识进行归纳一般化。仿真实验表明,XCSG算法很好地解决了多机器人学习空间大、学习速度慢、学习效果不确定等问题。XCS classifier system has been shown to solve machine-learning problems in a competitive way. However, in multi-robot problems,XCS is restricted to solve very small problems modeled by a Markov decision process. In this pa- per a new learning technique XCSG that combines XCS and gradient descent methods was proposed to solve multi-robot machine--learning problems. XCSG builds love-dimensional approximation of the function, and gradient descent tech- niques use on--line knowledge to establish a stable approximation of functions, so that the Q-form has been maintained at a low-dimensional stable state. Approximate of the function not only requires smaller storage space, but also allows the robot online knowledge is summarized on the generalization. Simulation results show that XCSG algorithm solves the multi--robot reinforcement learning in a large space, slow learning, learning uncertainty and other issues.

关键词：强化学习多机器人学习分类器梯度下降法的学习分类器

分类号：TP181[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

XCSG在多机器人强化学习中的应用被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

XCSG在多机器人强化学习中的应用 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

XCSG在多机器人强化学习中的应用被引量：2