开放信息抽取研究综述被引量：1

Survey of Open Information Extraction Research

作　　者：胡杭乐程春雷[1,2] 叶青彭琳[1] 沈友志[1] HU Hangle;CHENG Chunlei;YE Qing;PENG Lin;SHEN Youzhi(School of Computer Science,Jiangxi University of Chinese Medicine,Nanchang 330004,China;Key Laboratory of Artificial Intelligence in Chinese Medicine,Jiangxi University of Chinese Medicine,Nanchang 330004,China)

机构地区：[1]江西中医药大学计算机学院,南昌330004 [2]江西中医药大学中医人工智能重点研究室,南昌330004

出　　处：《计算机工程与应用》2023年第16期31-49,共19页Computer Engineering and Applications

基　　金：国家自然科学基金(82260988);江西省自然科学基金(20224BAB206102);江西省教育厅科学技术研究项目(GJJ2200923);江西省卫生和计划生育委员会科技计划项目(202211404)。

摘　　要：开放信息抽取(open information extraction,OpenIE)旨在从自然语言文本中以关系短语及参数的形式生成信息的结构化表示,为知识库自动化构建、开放域问答和显式推理等下游任务提供基础支持。近年来,该领域的研究与应用不断深入,涌现了众多卓有成效的OpenIE研究思路和拓展模型。从OpenIE的定义、数据集和基准度量出发,详细深入地综述和比较了传统的OpenIE模型和基于神经网络的模型。针对传统方法,分类介绍了基于学习的模型和基于规则的模型,并深入研究了不同模型的评估方法,分析了不同类别模型之间的差异。针对基于神经网络的模型,根据抽取谓词的不同方式,将其分为联合抽取和分步抽取两种类型,并对每种模型进行了综述和对比分析。对OpenIE常用的数据集以及主要的评估基准进行了概述,并在此基础上进行了对比分析。从训练、改进以及应用三个角度对OpenIE的工作进行了总结,并对该工作的未来进行了展望。Open information extraction(OpenIE)aims to generate a structured representation of information from natural language text in the form of relational phrases and parameters,providing basic support for downstream tasks such as knowledge base automatic construction,open domain question answering,and explicit reasoning.In recent years,with the deepening of research in this field,researchers have expanded OpenIE from multiple directions and proposed many OpenIE models based on neural networks.Starting from the definition,dataset and benchmark measurement of OpenIE,this paper summarizes and compares the traditional OpenIE model and the model based on neural network in detail.First of all,according to the traditional methods,the learning-based model and rule-based model are introduced,the evaluation methods of different models are deeply studied,and the differences between different types of models are analyzed.Secondly,according to the different ways of extracting predicates,the models based on neural networks are divided into two types:joint extraction and step extraction,and each model is reviewed and compared.Then,the datasets commonly used by OpenIE and the main evaluation benchmarks are summarized,and a comparative analysis is made on this basis.Finally,the work of OpenIE is summarized from three aspects of training,improvement and application,and the future of this work is prospected.

关键词：自然语言处理开放信息抽取(OpenIE) 神经网络

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

开放信息抽取研究综述被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

开放信息抽取研究综述 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

开放信息抽取研究综述被引量：1