检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张世辉[1,2] 赵维勃 王磊 王威[1] 李群鹏 ZHANG Shihui;ZHAO Weibo;WANG Lei;WANG Wei;LI Qunpeng(School of Information Science and Engineering,Yanshan University,Qinhuangdao 066004,China;The Key Laboratory for Computer Virtual Technology and System Integration of Hebei Province,Qinhuangdao 066004,China)
机构地区:[1]燕山大学信息科学与工程学院,秦皇岛066004 [2]河北省计算机虚拟技术与系统集成重点实验室,秦皇岛066004
出 处:《电子与信息学报》2023年第6期2236-2245,共10页Journal of Electronics & Information Technology
基 金:中央引导地方科技发展资金项目(216Z0301G);河北省自然科学基金(F2019203285);河北省创新能力提升计划项目(22567626H)。
摘 要:尺度变化、遮挡和复杂背景等因素使得拥挤场景下的人群数量估计成为一项具有挑战性的任务。为了应对人群图像中的尺度变化和现有多列网络中规模限制及特征相似性问题,该文提出一种多尺度交互注意力人群计数网络(Multi-Scale Interactive Attention crowd counting Network,MSIANet)。首先,设计了一个多尺度注意力模块,该模块使用4个具有不同感受野的分支提取不同尺度的特征,并将各分支提取的尺度特征进行交互,同时,使用注意力机制来限制多列网络的特征相似性问题。其次,在多尺度注意力模块的基础上设计了一个语义信息融合模块,该模块将主干网络的不同层次的语义信息进行交互,并将多尺度注意力模块分层堆叠,以充分利用多层语义信息。最后,基于多尺度注意力模块和语义信息融合模块构建了多尺度交互注意力人群计数网络,该网络充分利用多层次语义信息和多尺度信息生成高质量人群密度图。实验结果表明,与现有代表性的人群计数方法相比,该文提出的MSIANet可有效提升人群计数任务的准确性和鲁棒性。Factors such as scale variation,occlusion and complex backgrounds make crowd number estimation in crowded scenes a challenging task.To cope with the scale variation in crowd images and the scope limitation and the feature similarity problem in existing multi-column networks,a Multi-Scale Interactive Attention crowd counting Network(MSIANet)is proposed in this paper.Firstly,a multi-scale attention module is designed,which uses four branches with different perceptual fields to extract features at different scales and interacts the scale features extracted from each branch.At the same time,an attention mechanism is used to limit the feature similarity problem of the multi-column network.Secondly,a semantic information fusion module is designed based on the multi-scale attention module,which interacts different levels of semantic information of the backbone network and stacks the multi-scale attention module in layers to make full use of the multi-layer semantic information.Finally,a multi-scale interactive attention crowd counting network is constructed based on the multi-scale attention module and the semantic information fusion module,which makes full use of multi-level semantic information and multi-scale information to generate high-quality crowd density maps.The experimental results show that compared with the existing representative crowd counting methods,the proposed MSIANet can effectively improve the accuracy and robustness of the crowd counting task.
分 类 号:TN911.73[电子电信—通信与信息系统] TP391.41[电子电信—信息与通信工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229