融合CNN-SAM与GAT的多标签文本分类模型  被引量:7

Multi-Label Text Classification Model Combining CNN-SAM and GAT

在线阅读下载全文

作  者:杨春霞[1,2,3] 马文文 陈启岗 桂强 YANG Chunxia;MAWenwen;CHEN Qigang;GUI Qiang(School of Automation,Nanjing University of Information Science&Technology,Nanjing 210044,China;Jiangsu Key Laboratory of Big Data Analysis Technology(B-DAT),Nanjing 210044,China;Jiangsu Collaborative Innovation Center of Atmospheric Environment and Equipment Technology(CICAEET),Nanjing 210044,China)

机构地区:[1]南京信息工程大学自动化学院,南京210044 [2]江苏省大数据分析技术重点实验室,南京210044 [3]江苏省大气环境与装备技术协同创新中心,南京210044

出  处:《计算机工程与应用》2023年第5期106-114,共9页Computer Engineering and Applications

基  金:国家自然科学基金(61273229);江苏省青蓝工程资助项目。

摘  要:现有基于神经网络的多标签文本分类研究方法存在两方面不足,一是不能全面提取文本信息特征,二是很少从图结构数据中挖掘全局标签之间的关联性。针对以上两个问题,提出融合卷积神经网络-自注意力机制(CNNSAM)与图注意力网络(GAT)的多标签文本分类模型(CS-GAT)。该模型利用多层卷积神经网络与自注意力机制充分提取文本局部与全局信息并进行融合,得到更为全面的特征向量表示;同时将不同文本标签之间的关联性转变为具有全局信息的边加权图,利用多层图注意力机制自动学习不同标签之间的关联程度,将其与文本上下文语义信息进行交互,获取具有文本语义联系的全局标签信息表示;使用自适应融合策略进一步提取两者特征信息,提高模型的泛化能力。在AAPD、RCV1-V2与EUR-Lex三个公开英文数据集上的实验结果表明,该模型所达到的多标签分类效果明显优于其他主流基线模型。The existing research methods of multi-label text classification based on neural network have two shortcomings:one is that they can not fully extract text information features, and the other is that they rarely mine the association between global labels from graph structure data. To solve the above two problems, this paper proposes a multi-label text classification model(CS-GAT)integrating convolutional neural network self attention mechanism and graph attention network. The model uses multi-layer convolutional neural network and self attention mechanism to fully extract and fuse the local and global information of the text, so as to obtain a more comprehensive feature vector representation. At the same time, the relevance between different text labels is transformed into an edge weighted graph with global information.The multi-layer graph attention mechanism is used to automatically learn the degree of association between different labels, and then interact with the text context semantic information to obtain the global label information representation with text semantic connection. Finally, the adaptive fusion strategy is used to further extract the feature information of the two models to improve the generalization ability of the model. The experimental results on three open English data sets,AAPD, RCV1-V2 and EUR-Lex, show that the multi-label classification effect achieved by this model is significantly better than other mainstream baseline models.

关 键 词:多标签文本分类 多层卷积神经网络 自注意力机制 多头图注意力机制 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象