基于多视图立体深度学习的堆叠工件三维重建  

3D Reconstruction of Stacked Workpieces Based on Multi-view Stereo Deep Learning

在线阅读下载全文

作  者:姬田杰 郑飂默[2] 曹克让 王诗宇[2] 周淞杰 JI Tian-Jie;ZHENG Liao-Mo;CAO Ke-Rang;WANG Shi-Yu;ZHOU Song-Jie(College of Computer Science and Technology,Shenyang University of Chemical Technology,Shenyang 110142,China;Shenyang Institute of Computing Technology,Chinese Academy of Sciences,Shenyang 110168,China;Liaoning Provincial Key Laboratory of Intelligent Technology for Chemical Process Industry,Shenyang 110142,China)

机构地区:[1]沈阳化工大学计算机科学与技术学院,沈阳110142 [2]中国科学院沈阳计算技术研究所,沈阳110168 [3]辽宁省化工过程工业智能化技术重点实验室,沈阳110142

出  处:《计算机系统应用》2024年第12期153-160,共8页Computer Systems & Applications

基  金:“兴辽英才计划”产业高端人才项目(XLYC2207077);“兴沈英才计划”高水平技术创新人才项目;沈阳市中青年科技创新人才支持计划(RC210475)。

摘  要:随着工业自动化的不断发展,工件的三维重建技术在制造业中扮演着越来越重要的角色.在实际的工作环境下,工件普遍存在堆叠问题,对后续的机器人识别抓取等工作存在较大影响.目前三维重建技术对于一些具有弱纹理区域的工件重建,仍存在图像特征点提取难度大、特征配准精度低的问题.针对以上问题,本文提出了一种基于多视图立体匹配深度学习的堆叠工件三维重建方法.首先,输入多张不同视角的图像经过融合DCNv2的特征金字塔网络,进行特征提取;然后,进行单应性变换构建代价体,再使用方差聚合为一个统一的代价体;接着在代价体正则化部分,引入SE通道注意力机制模块来提高网络的特征表达能力,增强模型的性能和泛化能力;此方法在DTU(Danish Technical University)数据集上具有较好的表现,并且运用该方法生成的堆叠工件点云模型对以后的工业自动化开展具有重要意义.With the continuous development of industrial automation,the three-dimensional reconstruction technology of workpieces is playing an increasingly important role in the manufacturing industry.In actual working environments,there is a common problem of stacking workpieces,which significantly impacts subsequent work including robot recognition and grasping.Currently,it is hard for 3D reconstruction to extract image feature points and achieve accurate feature registration in workpieces with weak textures.To address the above issues,this study proposes a 3D reconstruction method for stacked workpieces based on deep learning with multi-view stereo matching.Firstly,multiple images from different perspectives are input through a DCNv2-based feature pyramid network for feature extraction.Then,homography transformation is performed to construct cost volumes,and a unified cost volume is obtained through variance aggregation.In the regularization section of the cost volume,an SE channel attention module is introduced to improve the feature expression ability of the network and enhance the performance and generalization ability of the model.This method exhibits good performance on the Danish Technical University(DTU)dataset.The point cloud model of stacked workpieces generated by this method is of great significance for future applications of industrial automation.

关 键 词:多视图三维重建 DCNv2 级联架构 通道注意力机制 

分 类 号:TH126[机械工程—机械设计及理论] TP18[自动化与计算机技术—控制理论与控制工程] TP391.41[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象