基于多尺度特征融合与背景抑制的MFFBSNet人群计数算法

A MFFBSNet crowd counting algorithm based on multi-scale feature fusion and background suppression

作　　者：赵佳彬徐慧英[1] 朱蓉陈滨王晓琳朱信忠[1] ZHAO Jia-bin;XU Hui-ying;ZHU Rong;CHEN Bin;WANG Xiao-Lin;ZHU Xin-zhong(School of Computer Science and Technology(School of Artificial Intelligence),Zhejiang Normal University,Jinhua 321004;Jiaxing Key Laboratory of Smart Transportations,Jiaxing 314001;College of Information Engineering,Jiaxing Nanhu University,Jiaxing 314001;Jiaxing Key Laboratory of Intelligent Computation and Data Science,Jiaxing 314001;College of Information Science and Engineering,Jiaxing University,Jiaxing 314001,China)

机构地区：[1]浙江师范大学计算机科学与技术学院(人工智能学院),浙江金华321004 [2]嘉兴市智慧交通重点实验室,浙江嘉兴314001 [3]嘉兴南湖学院信息工程学院,浙江嘉兴314001 [4]嘉兴市智能计算与数据科学重点实验室,浙江嘉兴314001 [5]嘉兴大学信息科学与工程学院,浙江嘉兴314001

出　　处：《计算机工程与科学》2024年第12期2205-2214,共10页Computer Engineering & Science

基　　金：国家自然科学基金(62376252);浙江省自然科学基金(LZ22F030003)。

摘　　要：针对复杂场景中的密集人群尺度变化、分布不均匀、背景遮挡等问题,提出一种基于多尺度特征融合与背景抑制的MFFBSNet人群计数算法。以视觉几何组网络VGG-16的前13层作为网络前端部分,引入空洞空间卷积池化金字塔(ASPP)和基于轻量级金字塔切分注意力机制(PSA)构建多尺度特征融合模块,以解决密集人群尺度变化问题;在网络的中间部分加入空间注意力机制以及通道注意力机制对特征图进行校准,突出图像人头区域;网络后端部分使用可加大感受野且不丢失图像分辨率的空洞卷积生成背景分割注意力图,抑制图像中背景噪声,提升人群分布密度图的质量。在ShanghaiTech、UCF_CC_50及NWPU-Crowd 3个公开数据集上的实验结果表明,相较于MCNN、SwitchCNN、CSRNet等算法,提出的基于MFFBSNet的人群计数算法的计数准确度较高。Aiming at the problems of scale variation,uneven distribution,and background occlusion of dense crowds in complex scenes,a crowd counting algorithm MFFBSNet based on multi-scale feature fusion and background suppression is proposed.The first 13 layers of the visual geometry group network VGG-16 are utilized as the front-end of the network.An atrous spatial pyramid pooling(ASPP)and a pyramid split attention(PSA)mechanism based on a lightweight design are introduced to construct a multi-scale feature fusion module,which addresses the problem of scale variation in dense crowds;In the middle of this network,spatial and channel attention mechanisms are incorporated to refine the feature maps,highlighting the head regions in the image;The backend of this network employs atrous convolution,which enlarges the receptive field without losing image resolution,to generate a background segmentation attention map.This suppresses background noise in the image and enhances the quality of the crowd density map.Experimental results on three public datasets,namely ShanghaiTech,UCF_CC_50,and NWPU-Crowd,demonstrate that the proposed crowd counting algorithm based on the MFFBSNet achieves higher counting accuracy compared to methods such as MCNN,SwitchCNN,and CSRNet.

关键词：密集人群计数多尺度融合背景抑制密度图

分类号：TP391.41[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多尺度特征融合与背景抑制的MFFBSNet人群计数算法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于多尺度特征融合与背景抑制的MFFBSNet人群计数算法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索