图推理嵌入动态自注意力网络的文档级关系抽取  

Document-level relation extraction of a graph reasoning embedded dynamic self-attention network

作  者:李云洁 王丹阳 刘海涛[1,3] 汪华东 汪培庄 LI Yunjie;WANG Danyang;LIU Haitao;WANG Huadong;WANG Peizhuang(Institute of Mathematics and Systems Science,Liaoning Technical University,Fuxin 123000,China;Institute of Scientific and Technical Information,Chinese Academy of Tropical Agricultural Sciences,Haikou 571000,China;Institute of Intelligence Engineering and Mathematics,Liaoning Technical University,Fuxin 123000,China;Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China)

机构地区:[1]辽宁工程技术大学理学院,辽宁阜新123000 [2]中国热带农业科学院科技信息研究所,海南海口571000 [3]辽宁工程技术大学智能工程与数学研究院,辽宁阜新123000 [4]清华大学计算机系,北京100084

出  处:《智能系统学报》2025年第1期52-63,共12页CAAI Transactions on Intelligent Systems

基  金:国家自然科学基金项目(61350003);辽宁省教育厅高等学校基本科研项目重点攻关项目(LJKZZ-20220047);中央级公益性科研院所基本科研业务费专项(1630072023005).

摘  要:文档级关系抽取是指从文档中抽取所有具有语义关系的实体对并判断其关系类别,与句子级关系抽取不同,这里实体关系的确定需要根据文档中多个句子推理得到。现有方法主要采用自注意力进行文档级关系抽取,但是运用自注意力进行文档级关系抽取需要面临两个技术挑战:即长文本语义编码存在的高计算复杂度和关系预测需要的复杂推理建模,故提出一种图推理嵌入动态自注意力网络(graph reasoning embedded dynamic self-attention network,GSAN)模型。该模型借助门限词选择机制动态选择重要词计算自注意力实现对长文本语义依赖的高效建模,同时考虑以选择词为全局语义背景与实体候选、文档节点一起构建文档图,将文档图的图推理聚合信息嵌入到动态自注意力模块中,实现模型对复杂推理建模的能力。在公开的文档级关系数据集CDR和DocRED上的实验结果表明,文中提出的模型较其他基线模型有显著提升。Document-level relation extraction refers to the extraction of all entity pairs with semantic relationships from documents and judging their relationship categories.It is different from sentence-level relation extraction,where the determination of entity relationships needs to be inferred from multiple sentences in the document.The existing methods mainly use self-attention for document-level relation extraction,but the use of self-attention for document-level relation extraction needs to address two technical challenges:the high computational complexity of long text semantic encoding and the complex reasoning modeling required for relationship prediction.Therefore,a graph reasoning embedded dynamic self-attention network model(GSAN)is proposed.With the aid of gated word selection mechanism,GSAN dynamically selects important words to calculate self attention,achieving high-efficiency modeling for semantic dependency of long text sequences.At the same time,it is considered to construct a document graph with the word selection as the global semantic background,entity candidates and document nodes.Then,the graph reasoning aggregation information of the document graph being embedded into the dynamic self-attention module enables the model to model complex reasoning.The experimental results demonstrate that the proposed model is a significant improvement over other baseline models on the public document-level relation dataset CDR and DocRED.

关 键 词:文档级关系抽取 图推理 动态自注意力网络 自注意力机制 门限词选择机制 文档图 图注意力网络 关键词 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象