基于DPPO的移动采摘机器人避障路径规划及仿真  被引量:12

Obstacle Avoidance Path Planning and Simulation of Mobile Picking Robot Based on DPPO

在线阅读下载全文

作  者:林俊强 王红军[1] 邹湘军[1,2] 张坡 李承恩 周益鹏 姚书杰 Lin Junqiang;Wang Hongjun;Zou Xiangjun;Zhang Po;Li Chengen;Zhou Yipeng;Yao Shujie(College of Engineering,South China Agricultural University,Guangzhou 510642 China;Foshan-Zhongke Innovation Research Institute of Intelligent Agriculture and Robotics,Foshan 528200,China;Maritime Transport College,Ningbo University,Ningbo 315211,China)

机构地区:[1]华南农业大学工程学院,广东广州510642 [2]佛山市中科农业机器人与智慧农业创新研究院,广东佛山528200 [3]宁波大学海运学院,浙江宁波315211

出  处:《系统仿真学报》2023年第8期1692-1704,共13页Journal of System Simulation

基  金:国家自然科学基金(32071912);广东佛山大专项(2120001008424)。

摘  要:针对移动采摘机器人在野外作业过程中面临随机多变的复杂路径环境难以自主决策的难题,提出一种基于深度强化学习的自主避障路径规划方法。设定状态空间和动作空间,借助人工势场法设计奖励函数的思想,提出了一种基于碰撞锥避碰检测的障碍物惩罚系数设定方法,提高自主避碰能力。构建了虚拟仿真系统,使用分布式近端策略优化算法(distributed proximal policy optimization,DPPO)完成了移动采摘机器人的学习训练并进行实验验证。仿真结果表明:本系统能够快速、稳定的控制虚拟移动采摘机器人自主避障,获得更优的作业路径,为采摘机器人自主导航提供理论与技术支撑。Aiming at the autonomous decision-making difficulty of mobile picking robots in random and changeable complicated path environment during field operations,an autonomous obstacle avoidance path planning method based on deep reinforcement learning is propose.By setting the state space and action space and using the artificial potential field method to design the reward function,an obstacle penalty coefficient setting method based on collision cone collision avoidance detection is proposed to improve the autonomous collision avoidance ability.A virtual simulation system is constructed,in which the learning and training of the mobile picking robot is carried out and verified by experiments through the distributed proximal policy optimization(DPPO).Simulation results show that the system can quickly and stably control the virtual mobile picking robot to autonomously avoid obstacles and obtain a better operating path,which can provide theoretical and technical support for the autonomous navigation of picking robot.

关 键 词:深度强化学习 近端策略优化 移动采摘机器人 避障 路径规划 人工势场 碰撞锥 

分 类 号:TP242[自动化与计算机技术—检测技术与自动化装置] TP18[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象