POMDP模型在多机器人环境探测中的应用研究  被引量:3

Research on Multi-robot Environment Exploration using POMDP

在线阅读下载全文

作  者:孟磊 吴芝亮 王轶强 MENG Lei;WU Zhiliang;WANG Yiqiang(School of Mechanical Engineering, Tianjin University, Tianjin 300354, China)

机构地区:[1]天津大学机械工程学院,天津300354

出  处:《机械科学与技术》2022年第2期178-185,共8页Mechanical Science and Technology for Aerospace Engineering

基  金:国家自然科学基金项目(51205277)。

摘  要:为了提高多机器人环境探测的效率和精度,本文提供了一种基于部分可观马尔可夫决策过程(Partially observable markov decision process,POMDP)的路径规划方法来控制多个装有传感器的机器人实现对环境的协同探测。建立了多机器人环境探测系统的POMDP模型,以信息熵作为回报函数,令机器人沿着信息熵最大的方向不断移动。机器人对环境的信念采用非参数的、基于样本的表示,并用贝叶斯滤波来更新机器人对环境的信念。在仿真试验中,对两种环境的CO浓度进行了探测,都得到了精确的测量结果。与传统的全覆盖路径规划的方法相比,该方法在效率和精度上都具有优势。To improve the efficiency and accuracy for exploring a multi-robot environment,this paper proposes a path planning method based on the partially observable Markov decision process(POMDP)to control multiple robots equipped with sensors and to realize the coordinated exploration of the environment.Taken information entropy as the return function,the multi-robot environment exploration system based on the POMDP is established to move the robots with the largest information gain in the direction.The robot′s belief in the environment uses a non-parametric,sample-based representation,and the Bayesian filtering is used to update the robot′s belief in the environment.With our simulation software,the CO concentration of the two environments was precepted.The exploration results are in a good agreement with the predesigned environment.Compared with the traditional full coverage path planning method,the system proposed in this paper has advantages in both efficiency and accuracy.

关 键 词:多机器人 环境探测 POMDP 贝叶斯滤波 路径规划 

分 类 号:TP242[自动化与计算机技术—检测技术与自动化装置]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象