Norm-DP模型行人检测优化算法  

Optimized Pedestrian Detection Algorithm for Norm-DPModel

在线阅读下载全文

作  者:柴恩惠 马占飞[1] 智敏[2] CHAI Enhui;MA Zhanfei;ZHI Min(School of Information Science and Technology,Inner Mongolia University of Science and Technology Baotou Teachers􀆳College,Baotou,Inner Mongolia 014030,China;School of Computer Science,Inner Mongolia Normal University,Hohhot 010022,China)

机构地区:[1]内蒙古科技大学包头师范学院信息科学与技术学院,内蒙古包头014030 [2]内蒙古师范大学计算机科学学院,呼和浩特010022

出  处:《计算机科学与探索》2021年第3期545-552,共8页Journal of Frontiers of Computer Science and Technology

基  金:国家自然科学基金(61762071,61163025);内蒙古自治区自然科学基金(2016MS0614,2019MS06037,2018MS06008)。

摘  要:传统深度金字塔模型作为一种有效的行人检测算法备受关注,融合可变形部件模型和卷积神经网络模型,但特征提取部分使用的算法像素区域的大小不同,导致模型之间不能完全融合,在行人数量多、姿势复杂和有遮挡情况时的检测效果不理想。因此,提出一种基于规范化函数的深度金字塔模型(Norm-DP)算法,使用规范化函数融合可变形部件模型和卷积神经网络模型,直接从金字塔特征中提取正负样本,使用隐变量支持向量机进行模型训练,结合柔性非最大抑制(soft-NMS)算法和边界框回归(BBR)算法对定位框进行优化。分别使用INRIA和MS COCO数据集进行实验验证,在行人数量多、姿势复杂和有遮挡情况时,检测精度高于最优的可变形部件模型算法、卷积神经网络算法、深度金字塔模型算法和结合区域选择的卷积神经网络算法。The traditional deep pyramid model attracts much attention as an effective pedestrian detection algorithm.It combines deformable part model and convolutional neural network model.However,the algorithm adopted in the feature extraction section has different pixel area sizes,so the models cannot be fully fused.The detection result is not ideal when it comes to the situation with a large number of pedestrians,complex postures,and occlusions.Therefore,a deep pyramid model algorithm based on normalization function(Norm-DP)is proposed in this paper.This algorithm combines the deformable part model and the convolutional neural network model,which extracts positive and negative samples directly from the pyramid features.Model training is then conducted on a latent variable support vector machine.The positioning frame is optimized through soft-non-maximum suppression(soft-NMS)algorithm and bounding box regression(BBR)algorithm.Experimental verification is performed on INRIA and MS COCO datasets.As a result,the detection accuracy of the proposed algorithm is higher than the optimal deformable part model algorithm,convolutional neural network algorithm,deep pyramid model algorithm and convolutional neural network algorithm combined with region selection in the situation with many pedestrians,complex postures and occlusions.

关 键 词:卷积神经网络(CNN) 可变形部件模型算法 规范化深度金字塔(Norm-DP) 柔性非最大抑制(Soft-NMS) 边界框回归(BBR) 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象