开放式信息抽取研究进展  被引量:28

Progress in Open Information Extraction

在线阅读下载全文

作  者:杨博[1] 蔡东风[1] 杨华[2] 

机构地区:[1]沈阳航空航天大学知识工程研究中心,辽宁沈阳110136 [2]沈阳航空航天大学计算机学院,辽宁沈阳110136

出  处:《中文信息学报》2014年第4期1-11,36,共12页Journal of Chinese Information Processing

基  金:国家"十二五"科技支撑计划(2012BAH14F00);国家自然科学基金(61073123)

摘  要:从大规模非结构化文本中自动地抽取有用信息是自然语言处理和人工智能的一个重要目标。开放式信息抽取在高效挖掘网络文本信息方面已成为必然趋势,按关系参数可分为二元、多元实体关系抽取,该文按此路线对典型方法的现状和存在问题进行分析与总结。目前多数开放式实体关系抽取仍是浅层语义处理,对隐含关系抽取很少涉及。采用马尔科夫逻辑、本体结构推理等联合推理方法可综合多种特征,有效推断细微完整信息,为深入理解文本打开新局面。Extracting useful information automatically from large-scale unstructured texts has been a long-standing goal of NLP and AI. And open information extraction is now widely pursued for effective web information acquisition. Open information extraction can be divided into dual and n-tuple entity relation extraction according to the number of arguments involved. In accordance with these two aspects, this paper analyses several typical methods for open relation extraction together with their defects. It is indicated that most current methods still belong to shallow semantic processing, hardly considering the implicit relation. Therefore, it is beleved that the adoption of joint inference strategy such as the markov logic and the ontology structure based inference can take advantage of multiple features. The combination of open and open up a promising prospect to infer the fine and full information for open information extraction.

关 键 词:开放式信息抽取 联合推理 文本理解 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象