检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Chao Ji Diyuan Liu Wei Gao Shiwu Zhang
机构地区:[1]School of Engineering Science,University of Science and Technology of China,Hefei 230026,China [2]iFLYTEK Co.,Ltd.,Hefei 230088,China
出 处:《Biomimetic Intelligence & Robotics》2025年第1期108-115,共8页仿生智能与机器人(英文)
基 金:supported by the National Natural Science Foundation of China(U21A20119,62103395,and 51975550).
摘 要:The ability of bipedal humanoid robots to walk adaptively on varied terrain is a critical challenge for practical applications,drawing substantial attention from academic and industrial research communities in recent years.Traditional model-based locomotion control methods have high modeling complexity,especially in complex terrain environments,making locomotion stability difficult to ensure.Reinforcement learning offers an end-to-end solution for locomotion control in humanoid robots.This approach typically relies solely on proprioceptive sensing to generate control policies,often resulting in increased robot body collisions during practical applications.Excessive collisions can damage the biped robot hardware,and more critically,the absence of multimodal input,such as vision,limits the robot’s ability to perceive environmental context and adjust its gait trajectory promptly.This lack of multimodal perception also hampers stability and robustness during tasks.In this paper,visual information is added to the locomotion control problem of humanoid robot,and a three-stage multi-objective constraint policy distillation optimization algorithm is innovantly proposed.The expert policies of different terrains to meet the requirements of gait aesthetics are trained through reinforcement learning,and these expert policies are distilled into student through policy distillation.Experimental results demonstrate a significant reduction in collision rates when utilizing a control policy that integrates multimodal perception,especially in challenging terrains like stairs,thresholds,and mixed surfaces.This advancement supports the practical deployment of bipedal humanoid robots.
关 键 词:Bipedal humanoid robot Deep reinforcement learning Multimodal perception
分 类 号:TP242[自动化与计算机技术—检测技术与自动化装置]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145