一种具有情感和记忆机制的迷宫机器人认知模型  被引量:1

A cognitive model of maze robot with emotion and memory mechanism

在线阅读下载全文

作  者:张晓平 李凯[1] 王力[1] 闫佳庆 何忠贺[1] ZHANG Xiao-ping;LI Kai;WANG Li;YAN Jia-qing;HE Zhong-he(School of Electrical and Control Engineering,North China University of Technology,Beijing 100144,China)

机构地区:[1]北方工业大学电气与控制工程学院,北京100144

出  处:《控制与决策》2023年第10期2850-2858,共9页Control and Decision

基  金:国家自然科学基金项目(61903006);北京市自然科学基金项目(4202022,4204096,4212035);北京市长城学者项目(CIT&TCD 20190304);北京市教委科研项目(KM202010009006,201910009008)。

摘  要:情感作为人类的高级认知,在环境学习和环境理解方面具有重要意义.将情感引入机器人搜索任务,同时结合记忆机理,提出一种具有情感和记忆机制的认知模型,由内部状态、感受器、环境状态系统、情感系统、动态知识库、行为决策系统以及执行器7部分组成.情感系统包含情感生成、情感状态以及情感记忆3个模块,其中,情感记忆用于提供内部奖励.记忆功能在动态知识库中实现.基于强化学习理论框架,将情感内部奖励与记忆进行融合,形成新的奖励机制,并设计相关认知学习算法.以需要“能量补给”的迷宫机器人搜索任务对所提出认知模型进行验证,结果发现,当面对不同情境时,机器人会产生不同的情感.结合前期记忆,机器人所作决策更“拟人”,表明情感和记忆机制设计的有效性.将所提出认知模型、无情感决策认知模型、基于ε-greedy策略的Q学习算法进行对比,结果表明,情感和记忆的引入,能够提高机器人的学习效率,同时学习过程更稳定.As a higher level of human cognition,emotion plays an important role in environment learning and understanding.In this paper,emotion is introduced into the robot search task.By combined with memory mechanism,a cognitive model is proposed,which is composed of seven parts:internal state,receptor,environmental state system,emotion system,dynamic knowledge base,behavior decision system and actuator.In further,the emotion system consists of three modules:Emotion generation,emotion state and emotion memory,where the emotion memory is used to provide internal rewards.The memory function is implemented in the dynamic knowledge base.Based on the theoretical framework of reinforcement learning,the learning algorithm of the cognitive model is designed by integrating emotional internal reward and memory as a new reward mechanism.The maze robot search task requiring“energy replenishment”is tested,and results show that when faced with different situations,the robot can generate different emotions.Combined with the previous memory,the robot’s decision-making seems more“anthropomorphic”,which firstly proves the effectiveness of the design of the emotion and memory mechanism.The cognitive model,the cognitive model without emotion and the Q-learning algorithm based on the ε-greedy strategy are compared,and results show that with the mechanism of emotion and memory,the robot can learn faster,and at the same time,its learning process is more stable.

关 键 词:情感 记忆 内部奖励 迷宫机器人 认知模型 

分 类 号:TP242[自动化与计算机技术—检测技术与自动化装置]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象