联合多模态与多跨度特征的嵌套命名实体识别  

Nested Named Entity Recognition Combining Multi-modal and Multi-span Features

在线阅读下载全文

作  者:邱云飞[1] 邢浩然 于智龙 张文文 QIU Yunfei;XING Haoran;YU Zhilong;ZHANG Wenwen(School of Software,Liaoning Technical University,Huludao,Liaoning 123105,China;School of Business Administration,Liaoning Technical University,Huludao,Liaoning 123105,China)

机构地区:[1]辽宁工程技术大学软件学院,辽宁葫芦岛123105 [2]辽宁工程技术大学工商管理学院,辽宁葫芦岛123105

出  处:《计算机科学与探索》2024年第6期1613-1626,共14页Journal of Frontiers of Computer Science and Technology

基  金:国家自然科学基金(62173171);辽宁省自然科学基金(2015020095);辽宁省教育厅科学技术研究项目(LJYL051);阜新市矿产资源编制项目(1920411)。

摘  要:嵌套命名实体识别(NNER)因日趋重要的现实意义成为信息抽取的研究热点。但是,由于语料资源匮乏、穷举窗口受限以及跨度特征缺失等问题,面向垂直领域的NNER研究进展缓慢且存在实体识别错误或遗漏的问题。针对上述问题,提出一种以矿物学为研究背景,融合语料感知词典的垂直领域NNER模型。首先,结合点互信息、词频逆文本频率算法与注意力机制自动集成语料感知词典,同时扩展锚文本知识提升模型的训练精度。其次,从共享视角出发,设计三种多模态信息的融合策略,训练编码器学习字符、字形、词汇的扩展向量表示,通过三重积运算和切片注意力机制,筛选整合由多层感知机捕捉到的私有表征,缩小异质特征的空间差距。再次,以自底向上的层级架构确定跨度间的上下文关联,生成建议跨度集合,以双仿射机制和线性分类器获得目标跨度与相邻跨度、目标跨度内部表征、目标跨度边界等特征。最后,为目标跨度分配对应的实体类型标签。在六项数据集上的实验结果表明,相比于基线模型,提出的方法实现了显著的性能提升,能有效提升低资源场景下的NNER任务效果。Nested named entity recognition(NNER)has become a research hotspot in information extraction because of its increasingly important practical significance.However,due to the shortage of corpus resources,limited exhaustive windows,missing span features,etc.,NNER research in vertical field has made slow progress and there are problems of entity recognition errors or omissions.To solve these problems,a vertical field NNER model based on mineralogy and corpus awareness dictionary is proposed.Firstly,the point mutual information,word frequency inverse text frequency algorithm and attention mechanism are combined to automatically integrate the corpus awareness dictionary,and the anchor text knowledge is used to improve the training accuracy of the model.Secondly,from the shared perspective,three multi-modal information fusion strategies are designed to train the encoder to learn the extended vector representation of character,glyph and vocabulary.Through triple product operation and slicing attention mechanism,the private representations captured by the multi-layer perceptron are screened and integrated to narrow the spatial gap of heterogeneous features.Thirdly,the context association between spans is determined by a bottom-up hierarchical architecture,and the proposed span set is generated.The characteristics of target span and adjacent span,target span internal characterization,target span boundary,etc.are obtained by double affine mechanism and linear classifier.Finally,the corresponding entity type label is assigned to the target span.Experimental results on six datasets show that compared with baseline model,the proposed method achieves significant performance improvement and can effectively improve the NNER task effect in low-resource scenarios.

关 键 词:嵌套命名实体识别 多模态 多任务 远程监督 矿物学 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象