检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王粉花[1,2,3] 赵波 黄超 严由齐 WANG Fenhua;ZHAO Bo;HUANG Chao;YAN Youqi(School of Automation and Electrical Engineering,University of Science and Technology Beijing,Beijing 100083,China;Institute of Artificial Intelligence,University of Science and Technology Beijing,Beijing 100083,China;Beijing Engineering Research Center of Industrial Spectrum Imaginghe,Beijing 100083,China)
机构地区:[1]北京科技大学自动化学院,北京100083 [2]北京科技大学人工智能研究院,北京100083 [3]北京市工业波谱成像工程中心,北京100083
出 处:《电子与信息学报》2020年第12期3045-3052,共8页Journal of Electronics & Information Technology
基 金:国家重点研发计划重点专项(2017YFB1400101-01);北京科技大学中央高校基本科研业务费专项(FRF-BD-19-002A)。
摘 要:行人重识别的关键依赖于行人特征的提取,卷积神经网络具有强大的特征提取以及表达能力。针对不同尺度下可以观察到不同的特征,该文提出一种基于多尺度和注意力网络融合的行人重识别方法(MSAN)。该方法通过对网络不同深度的特征进行采样,将采样的特征融合后对行人进行预测。不同深度的特征图具有不同的表达能力,使网络可以学习到行人身上更加细粒度的特征。同时将注意力模块嵌入到残差网络中,使得网络能更加关注于一些关键信息,增强网络特征学习能力。所提方法在Market1501,DukeMTMC-reID和MSMT17_V1数据集上首位准确率分别到了95.3%,89.8%和82.2%。实验表明,该方法充分利用了网络不同深度的信息和关注的关键信息,使模型具有很强的判别能力,而且所提模型的平均准确率优于大多数先进算法。The key to person re-identification depends on the extraction of pedestrian characteristics.Convolutional neural networks have powerful feature extraction and expression capabilities.In view of the fact that different features can be observed at different scales,a pedestrian re-identification method based on Multi-Scale Attention Network(MSAN)fusion is proposed.This method samples the features at different depths of the network and fuses the sampled features to predict pedestrians.Feature maps of different depths have different expressive powers,enabling the network to learn more fine-grained features of pedestrians.At the same time,the attention module is embedded in the residual network,so that the network can pay more attention to some key information and enhance the network feature learning ability.The accuracy of the proposed method on the datasets such as Market1501,DukeMTMC-reID and MSMT17_V1 reaches 95.3%,89.8%and 82.2%,respectively.Experiments show that the method makes full use of the information of different depths of the network and the key information of interest,so that the model has strong discriminating ability,and the average accuracy of the proposed model is better than most state-of-the-art algorithms.
分 类 号:TN911.73[电子电信—通信与信息系统] TP391[电子电信—信息与通信工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.191.97.68