结合语义与图像信息的行人属性识别算法  被引量:1

Pedestrian Attribute Recognition Algorithm Combining Semantic and Image Information

在线阅读下载全文

作  者:杨祖赫 黎智辉[2] 唐云祁[1] 晏于文 宋华青 YANG Zuhe;LI Zhihui;TANG Yunqi;YAN Yuwen;SONG Huaqing(School of Investigation,People's Public Security University of China,Beijing 100038,China;Institute of Forensic Science of China,Beijing 100038,China)

机构地区:[1]中国人民公安大学侦查学院,北京100038 [2]公安部物证鉴定中心,北京100038

出  处:《计算机工程》2023年第8期215-222,231,共9页Computer Engineering

基  金:国家重点研发计划(2021YFF0602102);公安部技术研究计划(2019JSYJA06);公安部物证鉴定中心基本科研专项(2022JB024)。

摘  要:为提升行人属性的识别精度,充分利用行人属性间自然语义关联并解决不同属性相关图像信息的提取差问题,提出结合语义与图像信息的行人属性识别算法。通过自注意力机制的关系建模能力挖掘行人属性间的内在联系,利用交叉注意力机制建立属性间语义信息与图像特征信息的关系。在此基础上,依靠卷积融合图像的高阶与低阶特征并为模块增加局部特征信息,提升模型的泛化能力,通过设计属性预测模块,使模型可与任意骨干网络相拼接,进一步提升识别性能。实验结果显示,该算法的平均精度、准确率、F1值在PA-100K和PETA数据集上分别为84.04%、79.71%、88.03%和89.04%、82.39%、89.06%,与ALM、JLAC等算法相比,能够充分利用属性语义与图像特征信息,在多项评价指标上有明显提升。To improve the recognition precision of pedestrian attributes and solve the problems of lack of use of natural semantic associations between pedestrian attributes and poor extraction of image information related to different attributes,this study proposes a pedestrian attribute recognition algorithm that combines semantic and image information.First,the relationship modeling ability of self-attention mechanism is utilized to explore the intrinsic relationship between pedestrian attributes,and cross-attention is utilized to establish the relationship between the semantic information between attributes and image feature information.Second,based on convolutional fusing high and low-order features,and adding local feature information into the module,the generalization ability of the model is improved.Owing to the design of the attribute prediction module,the model can be spliced with any backbone network and exhibits good performance.The experimental results show that the mean precision,accuracy,and F1 value of the proposed algorithm on the PA-100K and PETA datasets are 84.04%,79.71%,88.03%,and 89.04%,82.39%,89.06%,respectively.Compared with existing algorithms such as ALM and JLAC,this algorithm can exploit attribute semantics and image feature information and has a significant improvement in multiple evaluation indicators.

关 键 词:行人属性识别 自注意力 卷积 特征融合 多标签分类 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象