检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:琚玲 周星群 胡志强[2,3] 杨翊[2,3] 李黎明 白士红 JU Ling;ZHOU Xingqun;HU Zhiqiang;YANG Yi;LI Liming;BAI Shihong(School of Mechanical Engineering,Shenyang Ligong University,Shenyang 110159,China;State Key Laboratory of Robotics,Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang 110016,China;Institutes for Robotics and Intelligent Manufacturing,Chinese Academy of Sciences,Shenyang 110169,China;University of Chinese Academy of Sciences,Beijing 100049,China)
机构地区:[1]沈阳理工大学机械工程学院,辽宁沈阳110159 [2]中国科学院沈阳自动化研究所机器人学国家重点实验室,辽宁沈阳110016 [3]中国科学院机器人与智能制造创新研究院,辽宁沈阳110169 [4]中国科学院大学,北京100049
出 处:《信息与控制》2023年第2期129-141,共13页Information and Control
基 金:中国科学院先导专项(XDC03060201)。
摘 要:针对水下场景水下机器人(AUV)位姿数据集难以获取、现有的基于深度学习的位姿估计方法无法应用的问题,提出了一种基于合成数据的AUV视觉定位方法。首先基于Unity3D仿真搭建虚拟水下场景,通过虚拟相机获取仿真环境下已知的渲染位姿数据。其次,通过非配对图像转换工作实现渲染图片到真实水下场景下的风格迁移,结合已知渲染图片的位姿信息得到了合成的水下位姿数据集。最后,提出一种基于局部区域关键点投影的卷积神经网络(CNN)位姿估计方法,并基于合成数据训练网络,预测已知参考角点的2维投影,产生2D-3D点对,基于随机一致性采样的Perspective-n-Point(PnP)算法获得相对位置和姿态。本文在渲染数据集以及合成数据集上进行了定量实验,并在真实水下场景进行了定性实验,论证了所提出方法的有效性。实验结果表明,非配对图像转换能够有效消除渲染图像与真实水下图像之间的差距,所提出的局部区域关键点投影方法可以进行更有效的6D位姿估计。The autonomous underwater vehicle(AUV)pose dataset is difficult to obtain in underwater scenarios.In addition,the existing deep learning-based pose estimation methods cannot be applied in this scenario.Thus,this paper proposes an AUV visual localization method based on synthetic data.In this method,we first build a virtual underwater scene by Unity3D and obtain the rendering data of the known pose through the virtual camera.Then,we realize the style transfer of the rendered image to the real underwater scene through the unpaired image translation work.We also obtain the synthetic underwater pose dataset by combining the pose information of the known rendered image.Finally,we propose a convolutional neural network(CNN)pose estimation method based on local region keypoint projections.The CNN is trained using synthetic data to predict 2D projections of known reference corners.The resulting 2D-3D point pairs obtain the relative positions and pose through the Perspective-n-Point algorithm that is based on random sample consensus.The effectiveness of the proposed method is examined using quantitative experiments on rendered datasets and synthetic datasets,as well as qualitative experiments on real underwater scenes.Our experimental results show that the unpaired image translation can effectively eliminate the gap between the rendered image and the real underwater image.We also find that the proposed local area keypoint projection method can perform more effective 6D pose estimation.
关 键 词:水下机器人 位姿估计 视觉定位 图像生成 合成数据 深度学习
分 类 号:TP183[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.16.160.142