基于序列模型的作战文书知识抽取技术研究  被引量:2

Research on Knowledge Extraction Technology of Combat Document Based on Sequence Model

在线阅读下载全文

作  者:王乾铭 程健庆[1] 李吟[1] WANG Qianming;CHENG Jianqing;LI Yin(Jiangsu Automation Research Institute of CSIC,Lianyungang 222006)

机构地区:[1]江苏自动化研究所,连云港222006

出  处:《舰船电子工程》2020年第4期16-20,43,共6页Ship Electronic Engineering

基  金:装发共性基础课题“基于机器学习的军事知识生成、演化与评估技术”(编号:31511120201)资助。

摘  要:作战文书具有实体名称复杂多样但结构规范的特点,并且在句子中有大量的重叠实体关系。对于作战文书的知识抽取,现有的方法中采用的流水线模型有误差传播以及关系冗余的问题造成关系抽取能力较差,并且现有的流水线模型无法抽取作战文书中复杂的重叠实体关系。针对这些问题,文中提出了一种基于序列生成模型并结合位置注意力机制的实体与关系联合抽取模型。通过使用作战文书作为数据集并与其他知识抽取模型做对比实验,论文模型既提高了识别非重叠实体关系的准确率,又实现了对重叠实体关系的抽取,从而提高了作战文书知识抽取的整体效果。Combat documents have the characteristics of complex and diverse entity names but standardized structure,and there are a large number of overlapping entity relationships in the sentence.For the knowledge extraction of combat documents,the pipeline model adopted in the existing method has the problem of error propagation and relationship redundancy,which results in poor relation extraction capabilities,and the existing pipeline model cannot extract complex overlapping entity relationships in com⁃bat documents.In response to these problems,the paper proposes a joint generation model of entities and relationships based on a sequence generation model combined with a location attention mechanism to achieve overlapping entities relation extraction.By us⁃ing combat documents as a data set and other knowledge extraction models for comparison experiments,this paper improves the ac⁃curacy of identifying overlapping entity relationships.

关 键 词:作战文书 知识抽取 重叠实体关系 序列模型 位置注意力机制 

分 类 号:TP311.10[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象