基于深度强化学习的机械臂容错控制方法被引量：9

Fault tolerant control method of manipulator based on deep reinforcement learning

作　　者：李铭浩[1] 张华[1] 刘满禄[1,2] 李新茂[1] 周祺杰 LI Minghao;ZHANG Hua;LIU Manlu;LI Xinmao;ZHOU Qijie(School of Information Engineering,Southwest University of Science and Technology,Mianyang 621000,China;School of Information Science and Technology,University of Science and Technology of China,Hefei 230026,China)

机构地区：[1]西南科技大学信息工程学院,四川绵阳621000 [2]中国科学技术大学信息科学技术学院,安徽合肥230026

出　　处：《传感器与微系统》2020年第1期53-55,59,共4页Transducer and Microsystem Technologies

基　　金：西南科技大学研究生创新基金资助项目(18YCX122);西南科技大学研究生创新基金资助项目(17YCX122);国家“十三五”核能开发项目(20161295)

摘　　要：针对机械臂突发单关节故障的情况,提出一种基于深度强化学习的机械臂容错控制方法。在建立环境模型和奖罚机制的基础上,针对机械臂正常运行和故障运行的情况,使用无模型的强化学习算法进行离线训练。在Rviz中建立机械臂模型并使用上述网络进行在线控制。实验证明:该算法可以有效完成机械臂的正常控制和容错控制。Aiming at the case of sudden single joint failure of the manipulator,a fault-tolerant control method based on deep reinforcement learning is proposed.On the basis of setting up environment model and reward-penalty mechanism,model-free reinforcement learning algorithm is used to train offline in the case of normal operation and failure operation of the manipulator.Mechanical arm model is built in Rviz and online controlled is carried out by using the above network.Experiments show that the algorithm can effectively complete the normal control and fault tolerant control of the manipulator.

关键词：机械臂关节故障容错控制深度神经网络强化学习

分类号：TP242.2[自动化与计算机技术—检测技术与自动化装置] TP212[自动化与计算机技术—控制科学与工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度强化学习的机械臂容错控制方法被引量：9

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于深度强化学习的机械臂容错控制方法 被引量：9

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于深度强化学习的机械臂容错控制方法被引量：9