面向复杂山地环境的四足机器人路径规划方法

Path planning method for quadruped robot in complex mountain environment

作　　者：陈仁祥祝宇航杨黎霞何家乐唐煜斌 CHEN Renxiang;ZHU Yuhang;YANG Lixia;HE Jiale;TANG Yubin(School of Mechatronics and Vehicle Engineering,Chongqing Jiaotong University,Chongqing 400074,China;Business and Management College,Chongqing University of Science and Technology,Chongqing 401331,China)

机构地区：[1]重庆交通大学机电与车辆工程学院,重庆400074 [2]重庆科技大学工商管理学院,重庆401331

出　　处：《中国惯性技术学报》2024年第12期1250-1257,1262,共9页Journal of Chinese Inertial Technology

基　　金：国家自然科学基金(51975079);重庆市技术创新与应用示范项目(cstc2018jscx-msybX0012);重庆市教育委员会科学技术研究项目(KJZD-M202200701);交通工程应用机器人重庆市工程实验室开放基金(CELTEAR-KFKT-202002)。

摘　　要：针对四足机器人在山地环境中未考虑坡度因素而导致规划路径难以行走的问题,提出了一种基于坡度势能引导强化学习的山地路径规划方法。首先,根据坡度分级原则对山地模型进行划分,结合地形特点引入黑洞原理以改进人工势场法(APF),构建全局坡度势场,降低多维环境复杂度;其次,将势场中的势能经概率加权处理后输入至强化学习网络引导前期训练,加快算法收敛速度。最后,结合四足机器人行走特性提出一种坡度优化方法,将安全坡度范围作为阈值,通过坡度奖励函数对状态进行调节和优化。仿真结果表明,与近端策略优化算法(PPO)和两种改进强化学习算法相比,所提算法收敛性更好,安全到达目标点的规划成功率提高16.45%以上,路径最大坡度降低34.4%以上,能够规划出一条平均坡度在21°~25°的平稳路径。A mountain path planning method based on slope potential energy guided reinforcement learning is proposed to address the problem of quadruped robots not considering slope factors in mountain environments,which makes it difficult to plan paths.Firstly,the mountain model is divided according to the slope classification principle,and the black hole principle is introduced to improve the artificial potential field(APF)algorithm based on the terrain characteristics,and the global slope potential field is constructed to reduce the complexity of multi-dimensional environment.Secondly,the potential energy in the potential field is input into the reinforcement learning network after probability weighted processing to guide the early training,so as to accelerate the convergence speed of the algorithm.Finally,a slope optimization method is proposed based on the walking characteristics of quadruped robots,where the safe slope range is used as a threshold and the state is adjusted and optimized through the slope reward function.The simulation results show that compared with the proximal policy optimization(PPO)algorithm and two improved reinforcement learning algorithms,the proposed algorithm has better convergence,a success rate of safely reaching the target increase of over 16.45%,and a maximum path slope decrease of over 34.4%,and a stable path with an average slope of 21°~25°can be planned.

关键词：山地环境路径规划四足机器人强化学习人工势场法

分类号：TP242[自动化与计算机技术—检测技术与自动化装置]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

面向复杂山地环境的四足机器人路径规划方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

面向复杂山地环境的四足机器人路径规划方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索