融合多尺度特征与全局上下文信息的X光违禁物品检测  被引量:7

Integrated multi-scale features and global context in X-ray detection for prohibited items

在线阅读下载全文

作  者:李晨 张辉[1,2] 张邹铨 车爱博 王耀南[2] Li Chen;Zhang Hui;Zhang Zouquan;Che Aibo;Wang Yaonan(Changsha University of Science and Technology,Changsha 410114,China;Hunan University,Changsha 410082,China)

机构地区:[1]长沙理工大学,长沙410114 [2]湖南大学,长沙410082

出  处:《中国图象图形学报》2022年第10期3043-3057,共15页Journal of Image and Graphics

基  金:国家重点研发计划资助(2018YFB1308200);国家自然科学基金项目(61971071,62027810,62133005);湖南省杰出青年科学基金项目(2021JJ10025);湖南省重点研发计划资助(2021GK4011,2022GK2011);长沙市科技重大专项项目(kh2003026)。

摘  要:目的X光图像违禁物品检测一直是安检领域的一个基础问题,安检违禁物品形式各异,尺度变化大,以及透视性导致大量物体堆放时出现重叠遮挡现象,传统图像处理模型很容易出现漏检误检,召回率低。针对以上问题,提出一种融合多尺度特征与全局上下文信息的特征增强融合网络(feature enhancement fusion network,FEFNet)用于X光违禁物品检测。方法首先针对特征主干网络darknet53,加入空间坐标的注意力机制,将位置信息嵌入到通道注意力中,分别沿两个空间方向聚合特征,增强特征提取器对违禁目标的特征提取能力,抑制背景噪声干扰。然后,将特征提取主干网络输出的特征编码为1维向量,利用自监督二阶融合获取特征空间像素相关性矩阵,进而获取完整的全局上下文信息,为视觉遮挡区域提供全局信息指导。针对违禁物品尺度不一的问题,提出多尺度特征金字塔融合模块,增加一层小感受野预测特征用于提高对小尺度违禁目标的检测能力。最后,通过融合全局上下文特征信息和局部多尺度细节特征解决违禁物品之间的视觉遮挡问题。结果在SIXRay-Lite(security inspection X-ray)数据集上进行训练和验证,并与SSD(single shot detection)、Faster R-CNN、RetinaNet、YOLOv5(you only look once)和ACMNet(asymmetrical convolution multi-view neural network)模型进行了对比实验。结果表明,本文模型在SIXray-Lite数据集上的mAP(mean average precision)达到85.64%,特征增强融合模块和多尺度特征金字塔融合模块较原有模型分别提升了6.73%和5.93%,总体检测精度较原有检测网络提升了11.24%。结论提出的特征增强融合检测模型能够更好地提取显著差异特征,降低背景噪声干扰,提高对多尺度以及小型违禁物品的检测能力。同时利用全局上下文特征信息和多尺度局部特征相结合,有效地缓解了违禁物品之间的视觉遮挡现象,在保证实时性�Objective X-ray image detection is essential for prohibited items in the context of security inspection of those are different types,large-scale changes and most unidentified prohibited items.Traditional image processing models are concerned of to the status of missed and false inspections,resulting in a low model recall rate,and non-ideal analysis in realtime detection.Differentiated from regular optical images,X-ray images tends to the overlapping phenomena derived of a large number of stacked objects.It is challenged to extract effective multiple overlapping objects information for the deep learning models.The multiple overlapping objects are checked as a new object,resulting in poor classification effect and low detection accuracy.Our feature enhancement fusion network(FEFNet)is facilitated to the issue of X-ray detection of prohibited items based on multi-scale features and global context.Method First,the feature enhancement fusion model improves you only look once v3(YOLOv3)’s feature extractor darknet53 through adding a spatial coordinated attention mechanism.The improved feature extractor is called coordinate darknet,which embeds in situ information into the channel attention and aggregates features in two spatial directions.Coordinate darknet can extract more salient and discriminatory information to improve the feature extractor's ability.Specifically,the coordinated attention module is melted into the last four residual stages of the original darknet53,including two pooling modules.To obtain feature vectors in different directions,the width and height of the feature map are pooled adaptively.To obtain attention vectors in different directions,the feature vectors are processed in different directions through the batch normalization layer and activation layer.What’s more,the obtained attention vector is applied to the input feature map to yield the model to the detailed information.Next,our bilinear second-order fusion module extracts global context features.The module encodes the highest-dimensional

关 键 词:违禁品检测 X光图像 特征增强融合 注意力机制 多尺度融合 全局上下文信息 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象