基于特征校准的双注意力遮挡行人检测器  

Dual attention pedestrian detector for occlusion scenario based on feature calibration

在线阅读下载全文

作  者:汤书苑 周一青[1,2,3,4] 李锦涛 刘畅[1,2,4] 石晶林 TANG Shuyuan;ZHOU Yiqing;LI Jintao;LIU Chang;SHI Jinglin(State Key Laboratory of Processors,Institute of Computing Technology,CAS,Beijing 100190,China;Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;University of the Chinese Academy of Sciences,Beijing 100049,China;Beijing Key Laboratory of Mobile Computing and Pervasive Device,Beijing 100190,China)

机构地区:[1]中国科学院计算技术研究所处理器芯片全国重点实验室,北京100190 [2]中国科学院计算技术研究所无线通信技术研究中心,北京100190 [3]中国科学院大学,北京100049 [4]移动计算与新型终端北京市重点实验室,北京100190

出  处:《西安电子科技大学学报》2024年第6期25-39,共15页Journal of Xidian University

基  金:国家自然科学基金(U21A20449);江苏省重点研发计划(BE2021013-2)。

摘  要:基于计算机视觉的行人检测技术面临的主要挑战之一是遮挡问题,包括自然环境中物体对行人造成的类间遮挡以及行人与行人之间的类内遮挡。这些交织的遮挡模式限制了行人检测器的性能。为此,在Faster R-CNN标准行人检测框架的基础上,提出了一种基于特征校准的双注意力检测网络。该网络首先通过监督学习生成注意力掩码,用以表征图像中的行人空间特征;然后将掩码与主干特征融合,并结合通道注意力机制,校准行人区域。该方法能够增强行人的可见区域,同时减弱遮挡部分对分类和回归的干扰。此外,提出了一种基于遮挡率的非均匀采样策略,专门针对难例进行采样,帮助网络更有效地学习复杂遮挡模式。实验结果表明,与标准行人检测器相比,所提方法在CityPersons验证集的合理遮挡子集上性能提升了约2.5%。One of the major challenges faced by pedestrian detection technology based on computer vision is the issue of occlusion,including inter-class occlusion caused by objects in the natural environment and intra-class occlusion between pedestrians.These intertwined occlusion patterns limit the performance of pedestrian detectors.To address this problem,this paper proposes a dual-attention detection network based on feature calibration within the standard Faster R-CNN pedestrian detection framework.The network first generates attention masks through supervised learning to represent the spatial features of pedestrians in the image.These masks are then fused with backbone features and combined with a channel attention mechanism to calibrate pedestrian regions.This approach enhances the visibility of pedestrian regions while reducing the impact of occluded parts on classification and regression.Additionally,a non-uniform sampling strategy based on occlusion rates is introduced,targeting hard examples to allow the network to better learn complex occlusion patterns.Experimental results demonstrate that in comparison with standard pedestrian detectors,the proposed method achieves a 2.5%performance improvement on the reasonable occlusion subset of the CityPersons validation dataset.

关 键 词:卷积神经网络 行人检测 双注意力机制 特征校准 难例挖掘 遮挡率 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象