基于合成数据的水下机器人视觉定位方法被引量：2

Visual Localization Method of Autonomous Underwater Vehicle Based on Synthetic Data

作　　者：琚玲周星群胡志强[2,3] 杨翊[2,3] 李黎明白士红 JU Ling;ZHOU Xingqun;HU Zhiqiang;YANG Yi;LI Liming;BAI Shihong(School of Mechanical Engineering,Shenyang Ligong University,Shenyang 110159,China;State Key Laboratory of Robotics,Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang 110016,China;Institutes for Robotics and Intelligent Manufacturing,Chinese Academy of Sciences,Shenyang 110169,China;University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区：[1]沈阳理工大学机械工程学院,辽宁沈阳110159 [2]中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳110016 [3]中国科学院机器人与智能制造创新研究院,辽宁沈阳110169 [4]中国科学院大学,北京100049

出　　处：《信息与控制》2023年第2期129-141,共13页Information and Control

基　　金：中国科学院先导专项(XDC03060201)。

摘　　要：针对水下场景水下机器人(AUV)位姿数据集难以获取、现有的基于深度学习的位姿估计方法无法应用的问题,提出了一种基于合成数据的AUV视觉定位方法。首先基于Unity3D仿真搭建虚拟水下场景,通过虚拟相机获取仿真环境下已知的渲染位姿数据。其次,通过非配对图像转换工作实现渲染图片到真实水下场景下的风格迁移,结合已知渲染图片的位姿信息得到了合成的水下位姿数据集。最后,提出一种基于局部区域关键点投影的卷积神经网络(CNN)位姿估计方法,并基于合成数据训练网络,预测已知参考角点的2维投影,产生2D-3D点对,基于随机一致性采样的Perspective-n-Point(PnP)算法获得相对位置和姿态。本文在渲染数据集以及合成数据集上进行了定量实验,并在真实水下场景进行了定性实验,论证了所提出方法的有效性。实验结果表明,非配对图像转换能够有效消除渲染图像与真实水下图像之间的差距,所提出的局部区域关键点投影方法可以进行更有效的6D位姿估计。The autonomous underwater vehicle(AUV)pose dataset is difficult to obtain in underwater scenarios.In addition,the existing deep learning-based pose estimation methods cannot be applied in this scenario.Thus,this paper proposes an AUV visual localization method based on synthetic data.In this method,we first build a virtual underwater scene by Unity3D and obtain the rendering data of the known pose through the virtual camera.Then,we realize the style transfer of the rendered image to the real underwater scene through the unpaired image translation work.We also obtain the synthetic underwater pose dataset by combining the pose information of the known rendered image.Finally,we propose a convolutional neural network(CNN)pose estimation method based on local region keypoint projections.The CNN is trained using synthetic data to predict 2D projections of known reference corners.The resulting 2D-3D point pairs obtain the relative positions and pose through the Perspective-n-Point algorithm that is based on random sample consensus.The effectiveness of the proposed method is examined using quantitative experiments on rendered datasets and synthetic datasets,as well as qualitative experiments on real underwater scenes.Our experimental results show that the unpaired image translation can effectively eliminate the gap between the rendered image and the real underwater image.We also find that the proposed local area keypoint projection method can perform more effective 6D pose estimation.

关键词：水下机器人位姿估计视觉定位图像生成合成数据深度学习

分类号：TP183[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于合成数据的水下机器人视觉定位方法被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于合成数据的水下机器人视觉定位方法 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于合成数据的水下机器人视觉定位方法被引量：2