零部件光学影像精准定位的轻量化深度学习网络  被引量:1

Lightweight deep learning network for accurate localization of optical image components

在线阅读下载全文

作  者:牛小明 曾理[1] 杨飞 何光辉[1] NIU Xiaoming;ZENG Li;YANG Fei;HE Guanghui(College of Mathematics and Statistics,Chongqing University,Chongqing 401331,China;Chang Chun Champion Optics Co.,Ltd.,Changchun 130000,China)

机构地区:[1]重庆大学数学与统计学院,重庆401331 [2]长春长光辰谱科技有限公司,吉林长春130000

出  处:《光学精密工程》2023年第17期2611-2625,共15页Optics and Precision Engineering

基  金:国家自然科学基金资助项目(No.62076043);国家重点研发计划资助项目(No.2020YFB2007001)。

摘  要:光学影像精准定位是提高工业生产效率和质量的重要环节。传统图像处理定位方法由于光照、噪声等环境因素的影响,在复杂场景下定位精度低、易受干扰;而经典深度学习网络虽然在自然场景目标检测、工业安检、抓取、缺陷检测等得到了广泛应用,但是其海量数据的训练需求、复杂系统的深度学习大模型、检测框的冗余及不精确等问题,导致它不能直接应用于工业零部件像素级精准定位。针对以上问题,构建了一种零部件光学影像像素级精准定位的轻量化深度学习网络方法。网络总体选用Encoder-Decoder架构,Encoder使用三级bottleneck级联,在降低特征提取参变量的同时充分提升了网络的非线性;Encoder与Decoder对应特征层实施融合拼接,促使Encoder在上采样卷积后可以获得更多的高分辨率信息,进而更完备地重建出原始图像细节信息;最后,利用加权的Hausdorff距离构建了Decoder输出层与定位坐标点的关系。实验结果表明:轻量化深度学习定位网络模型参数为57.4 kB,定位精度小于等于5 pixel的识别率大于等于99.5%,基本满足工业零部件定位精度高、准确率高和抗干扰能力强等要求。Precise optical image localization is crucial for improving industrial production efficiency and quality.Traditional image processing and localization methods have low accuracy and are vulnerable to en⁃vironmental factors such as lighting and noise in complex scenes.Although classical deep learning net⁃works have been widely applied in natural-scene object detection,industrial inspection,grasping,defect detection,and other areas,directly applying pixel-level precise localization to industrial components is still challenging owing to the requirements of massive data training,complex deep learning models,and redun⁃dant and imprecise detection boxes.To address these issues,this paper proposes a lightweight deep learn⁃ing network approach for pixel-level accurate localization of component optical images.The overall design of the network adopts an Encoder–Decoder architecture.The Encoder incorporates a three-level bottle⁃neck cascade to reduce the parameter complexity of feature extraction while enhancing the network’s non⁃linearity.The Encoder and Decoder perform feature layer fusion and concatenation,enabling the Encoder to obtain more high-resolution information after upsampling convolution and to reconstruct the original im⁃age details more comprehensively.Finally,the weighted Hausdorff distance is utilized to establish the rela⁃tionship between the Decoder's output layer and the localization coordinates.Experimental results demon⁃strate that the lightweight deep learning localization network model has a parameter size of 57.4 kB,and the recognition rate for localization accuracy less than or equal to 5 pixels is greater than or equal to 99.5%.Thus,the proposed approach satisfies the requirements of high localization accuracy,high preci⁃sion,and strong anti-interference capabilities for industrial component localization.

关 键 词:机器视觉 光学影像 深度学习 精准定位 轻量化 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象