检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:柴恩惠 马占飞[1] 智敏[2] CHAI Enhui;MA Zhanfei;ZHI Min(School of Information Science and Technology,Inner Mongolia University of Science and Technology Baotou TeachersCollege,Baotou,Inner Mongolia 014030,China;School of Computer Science,Inner Mongolia Normal University,Hohhot 010022,China)
机构地区:[1]内蒙古科技大学包头师范学院信息科学与技术学院,内蒙古包头014030 [2]内蒙古师范大学计算机科学学院,呼和浩特010022
出 处:《计算机科学与探索》2021年第3期545-552,共8页Journal of Frontiers of Computer Science and Technology
基 金:国家自然科学基金(61762071,61163025);内蒙古自治区自然科学基金(2016MS0614,2019MS06037,2018MS06008)。
摘 要:传统深度金字塔模型作为一种有效的行人检测算法备受关注,融合可变形部件模型和卷积神经网络模型,但特征提取部分使用的算法像素区域的大小不同,导致模型之间不能完全融合,在行人数量多、姿势复杂和有遮挡情况时的检测效果不理想。因此,提出一种基于规范化函数的深度金字塔模型(Norm-DP)算法,使用规范化函数融合可变形部件模型和卷积神经网络模型,直接从金字塔特征中提取正负样本,使用隐变量支持向量机进行模型训练,结合柔性非最大抑制(soft-NMS)算法和边界框回归(BBR)算法对定位框进行优化。分别使用INRIA和MS COCO数据集进行实验验证,在行人数量多、姿势复杂和有遮挡情况时,检测精度高于最优的可变形部件模型算法、卷积神经网络算法、深度金字塔模型算法和结合区域选择的卷积神经网络算法。The traditional deep pyramid model attracts much attention as an effective pedestrian detection algorithm.It combines deformable part model and convolutional neural network model.However,the algorithm adopted in the feature extraction section has different pixel area sizes,so the models cannot be fully fused.The detection result is not ideal when it comes to the situation with a large number of pedestrians,complex postures,and occlusions.Therefore,a deep pyramid model algorithm based on normalization function(Norm-DP)is proposed in this paper.This algorithm combines the deformable part model and the convolutional neural network model,which extracts positive and negative samples directly from the pyramid features.Model training is then conducted on a latent variable support vector machine.The positioning frame is optimized through soft-non-maximum suppression(soft-NMS)algorithm and bounding box regression(BBR)algorithm.Experimental verification is performed on INRIA and MS COCO datasets.As a result,the detection accuracy of the proposed algorithm is higher than the optimal deformable part model algorithm,convolutional neural network algorithm,deep pyramid model algorithm and convolutional neural network algorithm combined with region selection in the situation with many pedestrians,complex postures and occlusions.
关 键 词:卷积神经网络(CNN) 可变形部件模型算法 规范化深度金字塔(Norm-DP) 柔性非最大抑制(Soft-NMS) 边界框回归(BBR)
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28