基于Boltzamnn机的机器人自主学习算法  被引量:1

Self-learning Algorithm for Robot Based on Boltzamnn Machine

在线阅读下载全文

作  者:任红格[1] 阮晓钢[1] 

机构地区:[1]北京工业大学电子信息与控制工程学院,北京100124

出  处:《北京工业大学学报》2012年第1期60-64,共5页Journal of Beijing University of Technology

基  金:国家'八六三'计划资助项目(2007AA04Z226);国家自然科学基金资助项目(60774077);北京市教委重点资助项目(KZ200810005002)

摘  要:针对两轮机器人自平衡运动控制问题,提出了一种基于Boltzamnn机的Skinner操作条件反射学习机制作为机器人仿生自主学习的算法.该算法利用Boltzamnn机中Metropolis判据平衡Skinner操作条件反射学习中探索和利用的比例,并依据概率取向机制以一定的概率选择最优行为,从而使机器人在未知环境下可获得像人或动物一样的仿生自主学习技能,实现机器人的自平衡运动控制.最后,分别用基于Boltzamnn机的Skinner操作条件反射的学习算法和基于贪婪策略的Skinner操作条件反射的学习算法做了仿真实验并进行了比较.结果表明,基于Boltzamnn机的Skinner操作条件反射的学习算法能使机器人获得较强的运动平衡控制技能和较好的动态性能,体现了机器人的自主学习特性.In view of the self-balancing movement control problem of the two-wheeled robot,a bionic self-learning algorithm of the robot is proposed as a study mechanism of Skinner's operant conditioning reflection based on the Boltzamnn machine.This algorithm uses the Metropolis criterion in Boltzamnn machine to balance in the proportion of the exploration and the exploitation in the study of Skinner's operant conditioning reflection,and chooses the most superior behavior through certain probability depending on the probability tropism mechanism.Thus the robot can obtain the skill of bionic self-learning like the human or the animal under the unknown environment,and realize the self-balancing movement control of the robot.Finally,the simulation experiments were conducted and the Skinner's operant conditioning reflection study algorithms based on the Boltzamnn machine and the greedy strategy were compared,separately.Results show that the Skinner's operant conditioning reflection study algorithm based on the Boltzamnn machine can obtain the stronger movement balancing control skill and the better dynamic performance,and manifest the self-learning characteristics of the robot.

关 键 词:Boltzamnn机 Skinner操作条件反射 贪婪策略 自主学习 两轮机器人 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象