检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:胡杭乐 程春雷[1,2] 叶青 彭琳[1] 沈友志[1] HU Hangle;CHENG Chunlei;YE Qing;PENG Lin;SHEN Youzhi(School of Computer Science,Jiangxi University of Chinese Medicine,Nanchang 330004,China;Key Laboratory of Artificial Intelligence in Chinese Medicine,Jiangxi University of Chinese Medicine,Nanchang 330004,China)
机构地区:[1]江西中医药大学计算机学院,南昌330004 [2]江西中医药大学中医人工智能重点研究室,南昌330004
出 处:《计算机工程与应用》2023年第16期31-49,共19页Computer Engineering and Applications
基 金:国家自然科学基金(82260988);江西省自然科学基金(20224BAB206102);江西省教育厅科学技术研究项目(GJJ2200923);江西省卫生和计划生育委员会科技计划项目(202211404)。
摘 要:开放信息抽取(open information extraction,OpenIE)旨在从自然语言文本中以关系短语及参数的形式生成信息的结构化表示,为知识库自动化构建、开放域问答和显式推理等下游任务提供基础支持。近年来,该领域的研究与应用不断深入,涌现了众多卓有成效的OpenIE研究思路和拓展模型。从OpenIE的定义、数据集和基准度量出发,详细深入地综述和比较了传统的OpenIE模型和基于神经网络的模型。针对传统方法,分类介绍了基于学习的模型和基于规则的模型,并深入研究了不同模型的评估方法,分析了不同类别模型之间的差异。针对基于神经网络的模型,根据抽取谓词的不同方式,将其分为联合抽取和分步抽取两种类型,并对每种模型进行了综述和对比分析。对OpenIE常用的数据集以及主要的评估基准进行了概述,并在此基础上进行了对比分析。从训练、改进以及应用三个角度对OpenIE的工作进行了总结,并对该工作的未来进行了展望。Open information extraction(OpenIE)aims to generate a structured representation of information from natural language text in the form of relational phrases and parameters,providing basic support for downstream tasks such as knowledge base automatic construction,open domain question answering,and explicit reasoning.In recent years,with the deepening of research in this field,researchers have expanded OpenIE from multiple directions and proposed many OpenIE models based on neural networks.Starting from the definition,dataset and benchmark measurement of OpenIE,this paper summarizes and compares the traditional OpenIE model and the model based on neural network in detail.First of all,according to the traditional methods,the learning-based model and rule-based model are introduced,the evaluation methods of different models are deeply studied,and the differences between different types of models are analyzed.Secondly,according to the different ways of extracting predicates,the models based on neural networks are divided into two types:joint extraction and step extraction,and each model is reviewed and compared.Then,the datasets commonly used by OpenIE and the main evaluation benchmarks are summarized,and a comparative analysis is made on this basis.Finally,the work of OpenIE is summarized from three aspects of training,improvement and application,and the future of this work is prospected.
关 键 词:自然语言处理 开放信息抽取(OpenIE) 神经网络
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28