基于多视角跨模态的电力现场作业行人重识别网络架构技术研究  被引量:2

Research on Network Structure of Pedestrian Recognition for Power Operation Field Based on Multi-view Cross-modal Image Processing

在线阅读下载全文

作  者:张森 张颉[2] 王尧 刘锦隆 闫斌[4] 尚赵伟[1] Zhang Sen;Zhang Jie;Wang Yao;Liu Jinglong;Yan Bin;Shang Zhaowei(College of Computer Science,Chongqing University,Chongqing 400000,China;State Grid Sichuan Electric Power Company,Chengdu 610041,Sichuan,China;State Grid Liangshan Electric Power Supply Company,Xichang 615000,Sichuan,China;School of Automation Engineering,University of Electronic Science and Technology,Chengdu 611731,Sichuan,China)

机构地区:[1]重庆大学计算机学院,重庆400000 [2]国网四川省电力公司,四川成都610041 [3]国网四川省电力公司凉山供电公司,四川西昌615000 [4]电子科技大学自动化学院,四川成都611731

出  处:《四川电力技术》2020年第6期6-10,15,共6页Sichuan Electric Power Technology

摘  要:可见光到红外光跨模态行人重识别目的是实现在白天和夜间环境下对行人身份的识别判断,在视频监控领域具有重要研究价值。因可见光和红外光成像原理的不同,给跨模态重识别问题带来了挑战。设计了一种新的网络结构,用于缓解模态间数据差异,提高行人重识别模型的精度。网络结构分为两部分:基于注意力的模态迁移模块嵌入特征网络的输入级,可缩小跨模态差异;基于分块的多粒度特征分解模块,同时考虑整体信息和局部信息并了提高有效信息的利用率。在公开数据集SYSU-MM01上,所提方法的累计匹配特性指标的rank1达到了56.45%,平均精确度指标达到了53.52%,比当前最佳方法(XIV,AAAI-2020)分别提高了6.53%和2.79%,有效提高了可见光到红外光跨模态行人重识别的性能。The purpose of cross-modal pedestrian recognition from visible to infrared light is to realize the identification and judgment of pedestrian identity in day and night environments,which is of great research value in the field of video surveillance.Due to the different imaging principles of visible light and infrared light,cross-modal recognition is a challenge.A new network structure is designed to alleviate the data difference between modals and improve the accuracy of pedestrian recognition model.The proposed network structure is divided into two parts:attention-based modal transfer module embedded in the input stage of the feature network,which can reduce the difference across modals,and block-based multi-granularity feature decomposition module,which can consider both global information and local information and improve the utilization rate of effective information.The experimental results on the open data set SYSU-MM01 show that,the rank 1 of the cumulative matching characteristic(CMC)index and the mean average precision(mAP)index of the proposed method reaches 56.45%and 53.52%,respectively,which are 6.53%and 2.79%higher than the current best method(XIV,AAAI-2020),and effectively improves the performance of cross-modal pedestrian recognition from visible to infrared light.

关 键 词:行人重识别 跨模态 注意力 多粒度特征 

分 类 号:TP394.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象