检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:Zhouzhou Ma Guanghua Gu Wenrui Zhao
机构地区:[1]School of Information Science and Engineering,Yanshan University,Qinhuangdao,066000,China [2]Hebei Key Laboratory of Information Transmission and Signal Processing,Qinhuangdao,066000,China
出 处:《Machine Intelligence Research》2024年第5期966-982,共17页机器智能研究(英文版)
基 金:supported by National Natural Science Foundation of China(No.62072394);Natural Science Foundation of Hebei Province,China(No.F2021203019);Hebei Key Laboratory Project,China(No.202250701010046).
摘 要:Most existing studies on crowd analysis are limited to the level of counting,which cannot provide the exact location of individuals.This paper proposes a self-attention guidance based crowd localization and counting network(SA-CLCN),which can simultaneously locate and count crowds.We take the form of object detection,using the original point annotations of crowd datasets as supervision to train the network.Ultimately,the center point coordinate of each head as well as the number of crowds are predicted.Specifically,to cope with the spatial and positional variations of the crowd,the proposed method introduces transformer to construct a globallocal feature extractor(GLFE)together with the convolutional structure.It establishes the near-to-far dependency between elements so that the global context and local detail features of the crowd image can be extracted simultaneously.Then,this paper designs a pyramid feature fusion module(PFFM)to fuse the global and local information from high level to low level to obtain a multiscale feature representation.In downstream tasks,this paper predicts candidate point offsets and confidence scores by a simple regression header and classification header.In addition,the Hungarian algorithm is used to match the predicted point set and the labelled point set to facilitate the calculation of losses.The proposed network avoids the errors or higher costs associated with using traditional density maps or bounding box annotations.Importantly,we have conducted extensive experiments on several crowd datasets,and the proposed method has produced competitive results in both counting and localization.
关 键 词:Crowd localization crowd counting transformer point supervision object detection
分 类 号:TP39[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:13.58.187.29