基于多模态图卷积神经网络的行人重识别方法  被引量:1

Person re-identification method based on multi-modal graph convolutional neural network

在线阅读下载全文

作  者:何嘉明 杨巨成[1] 吴超[1] 闫潇宁 许能华 HE Jiaming;YANG Jucheng;WU Chao;YAN Xiaoning;XU Nenghua(College of Artificial Intelligence,Tianjin University of Science and Technology,Tianjin 300457,China;Shenzhen Softsz Technology Company Limited,Shenzhen Guangdong 518131,China)

机构地区:[1]天津科技大学人工智能学院,天津300457 [2]深圳市安软科技股份有限公司,广东深圳518131

出  处:《计算机应用》2023年第7期2182-2189,共8页journal of Computer Applications

摘  要:针对行人重识别中行人文本属性信息未被充分利用以及文本属性之间语义联系未被挖掘的问题,提出一种基于多模态的图卷积神经网络(GCN)行人重识别方法。首先使用深度卷积神经网络(DCNN)学习行人文本属性与行人图像特征;然后借助GCN有效的关系挖掘能力,将文本属性特征与图像特征作为GCN的输入,通过图卷积运算来传递文本属性节点间的语义信息,从而学习文本属性间隐含的语义联系信息,并将该语义信息融入图像特征中;最后GCN输出鲁棒的行人特征。该多模态的行人重识别方法在Market-1501数据集上获得了87.6%的平均精度均值(mAP)和95.1%的Rank-1准确度;在DukeMTMC-reID数据集上获得了77.3%的mAP和88.4%的Rank-1准确度,验证了所提方法的有效性。Aiming at the problems that person textual attribute information is not fully utilized and the semantic relationships among the textual attributes are not mined in person re-identification,a person re-identification method based on multi-modal Graph Convolutional neural Network(GCN)was proposed.Firstly,Deep Convolutional Neural Network(DCNN)was used to learn person textual attributes and person image features.Then,with the help of the effective relationship mining ability of GCN,the textual attribute features and image features were treated as the input of GCN,and the semantic information of the textual attribute nodes was transferred through the graph convolution operation,so as to learn the implicit semantic relationship information among the textual attributes and incorporate this semantic information into image features.Finally,the robust person features were output by GCN.The multi-modal person re-identification method achieves the mean Average Precision(mAP)of 87.6% and the Rank-1 accuracy of 95.1% on Market-1501 dataset,and achieves the mAP of 77.3% and the Rank-1 accuracy of 88.4%on DukeMTMC-reID dataset,which verify the effectiveness of the proposed method.

关 键 词:行人重识别 多模态 图卷积神经网络 行人文本属性 隐含语义联系 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象