An object detection approach with residual feature fusion and second-order term attention mechanism  

在线阅读下载全文

作  者:Cuijin Li Zhong Qu Shengye Wang 

机构地区:[1]College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing,China [2]College of Electronic Information,Chongqing Institute of Engineering,Chongqing,China

出  处:《CAAI Transactions on Intelligence Technology》2024年第2期411-424,共14页智能技术学报(英文)

基  金:Doctoral Talent Training Project of Chongqing University of Posts and Telecommunications,Grant/Award Number:BYJS202007;Natural Science Foundation of Chongqing,Grant/Award Number:cstc2021jcyj-msxmX0941;National Natural Science Foundation of China,Grant/Award Number:62176034;Scientific and Technological Research Program of Chongqing Municipal Education Commission,Grant/Award Number:KJQN202101901。

摘  要:Automatically detecting and locating remote occlusion small objects from the images of complex traffic environments is a valuable and challenging research.Since the boundary box location is not sufficiently accurate and it is difficult to distinguish overlapping and occluded objects,the authors propose a network model with a second-order term attention mechanism and occlusion loss.First,the backbone network is built on CSPDarkNet53.Then a method is designed for the feature extraction network based on an item-wise attention mechanism,which uses the filtered weighted feature vector to replace the original residual fusion and adds a second-order term to reduce the information loss in the process of fusion and accelerate the convergence of the model.Finally,an objected occlusion regression loss function is studied to reduce the problems of missed detections caused by dense objects.Sufficient experimental results demonstrate that the authors’method achieved state-of-the-art performance without reducing the detection speed.The mAP@.5 of the method is 85.8%on the Foggy_cityscapes dataset and the mAP@.5 of the method is 97.8%on the KITTI dataset.

关 键 词:artificial intelligence computer vision image processing machine learning neural network object recognition 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象