基于图嵌入和区域注意力的多标签文本分类  被引量:15

Multi-label text classification based on graph embedding and region attention

在线阅读下载全文

作  者:王进[1] 徐巍[1] 丁一 孙开伟[1] 王利蕾 WANG Jin;XU Wei;DING Yi;SUN Kaiwei;WANG Lilei(Key Laboratory of Data Engineering and Visual Computing,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;School of Electronic Science and Engineering,Nanjing University,Nanjing,Jiangsu 210023,China)

机构地区:[1]重庆邮电大学数据工程与可视计算重点实验室,重庆400065 [2]南京大学电子科学与工程学院,江苏南京210023

出  处:《江苏大学学报(自然科学版)》2022年第3期310-318,共9页Journal of Jiangsu University:Natural Science Edition

基  金:国家自然科学基金青年科学基金资助项目(61806033);重庆市自然科学基金资助面上项目(cstc2019jcyj-msxmX0021)。

摘  要:针对传统多标签文本分类模型未考虑标签之间以及标签与文本各个部分之间的相关性、低频标签预测效果不佳的问题,使用图嵌入和区域注意力技术来挖掘标签之间以及标签和文本之间的关系,提出了编码器图嵌入和区域注意力机制解码器模型来处理多标签分类任务.采用Bi-LSTM作为编码器,使用图嵌入技术生成标签嵌入矩阵;利用区域注意力机制结合单词级别与区域级别的信息,使得模型在预测每个标签时考虑文本不同部分的信息,挖掘了文本与标签之间的潜在关联;使用循环神经网络和多层感知机作为解码器结合随机策略梯度算法,减少训练损失,改善多标签分类效果.在AAPD和RCV1-V2多标签文本分类数据集上进行试验,根据数据集特征设置相关参数,以micro-F1和Hamming Loss作为评价指标,对比所提出模型与LP、卷积神经网络等9个经典模型.结果表明,所提出模型能够根据高频标签预测出低频标签,在2个数据集上的micro-F1和Hamming Loss均优于经典模型.The traditional multi-label text classification method tends to ignore the correlations between labels and the correlations between labels and texts,and low frequency labels are not predicted well.To solve the problems,graph embedding technique and region attention mechanism were used to mine the correlation between labels.The encoder-graph embedding and the region attention-decoder were proposed to tackle the multi-label text classification.Bi-LSTM was used as encoder,and the label embedding matrix was generated by the graph embedding technique.The token-level and region-level information were combined by the regional attention mechanism to consider the information of different parts of the text during generating each label,which could potentially extract the association between text and label.Recurrent neural network(RNN)and multilayer perception(MLP)were used as decoders and combined with stochastic gradient method to improve multi-label classification.The experiments were carried out on AAPD dataset and RCV1-V2 dataset,and the relevant parameters were set according to the characteristics of the datasets.The micro-F1 and Hamming Loss were used as evaluation indexes to compare the proposed method with some classic ones,such as LP and CNN.The results show that the proposed method can predict the low frequency labels according to the higher ones,and it has higher micro-F1 and lower Hamming Loss than those by classical methods.

关 键 词:多标签 文本分类 序列到序列模型 图嵌入 区域注意力 循环神经网络 

分 类 号:TP391.9[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象