检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:马淼钒 黄勇 黄渊博 Miaofan Ma;Yong Huang;Yuanbo Huang(School of Mechanical Engineering,University of Shanghai for Science and Technology,Shanghai;Ningbo Industrial Internet Research Institute Co.,Ltd.,Ningbo Zhejiang)
机构地区:[1]上海理工大学机械工程学院,上海 [2]宁波工业互联网研究院有限公司,浙江宁波
出 处:《建模与仿真》2024年第3期3663-3676,共14页Modeling and Simulation
摘 要:针对机械臂运动中和未知环境接触以及建模误差带来的干扰,提出由导纳控制和基于双延迟深度确定性策略梯度算法(Twin Delayed Deep Deterministic Policy Gradient,TD3)的PID控制结合的控制策略。通过导纳控制,将机械臂和环境的接触力矩转化为关节角度以修正期望轨迹,抵消环境带来的扰动。并针对接触力矩测量成本高,提出使用基于卡尔曼滤波的广义动量观测器进行接触力矩的估计。针对内环控制,提出基于TD3的PID位置控制器,TD3是深度确定性策略梯度算法(Deep Deterministic Policy Gradient,DDPG)的改进算法,智能体在不断交互和学习的过程中能够自动调整PID参数,避免了经典PID控制在非线性控制问题中表现较差的缺点。搭建机械臂仿真平台,仿真实验结果表明,所设计的控制策略使机械臂具有良好的抗干扰效果。In order to solve the interference caused by unknown environment contact and modeling error,a control strategy combining admittance control and PID control based on Twin Delayed Deep Deterministic Policy Gradient(TD3)is proposed.The admittance control strategy is used to convert the contact torque between the manipulator and the environment into the joint Angle to correct the expected trajectory and offset the disturbance caused by the environment.In view of the high cost of measuring the contact torque,a generalized momentum observer based on Kalman filter is proposed to estimate the contact torque.For inner loop control,a PID position controller based on TD3 is proposed.TD3 is an improved algorithm based on Deep Deterministic Policy Gradient(DDPG).Agents can automatically adjust PID parameters in the process of continuous interaction and learning.The disadvantages of poor performance of classical PID control in nonlinear control problems are avoided.The simulation platform of the manipulator is built,and the simulation results show that the designed control strategy makes the manipulator have good anti-interference effect.
关 键 词:抗干扰 导纳控制 深度强化学习 TD3 卡尔曼滤波 接触力矩
分 类 号:TP2[自动化与计算机技术—检测技术与自动化装置]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30