基于条件随机场与规则相结合的中文地名识别  被引量:2

Recognition of Chinese Location Name based on Combination of Conditional Random Fields with Multi-rules

在线阅读下载全文

作  者:高国洋[1] 戚银城[1] 潘德锋[2] 

机构地区:[1]华北电力大学电子与通信工程系,河北保定071003 [2]华北电力大学计算机系,河北保定071003

出  处:《电脑开发与应用》2009年第8期26-28,共3页Computer Development & Applications

基  金:华北电力大学博士学位教师科研基金资助(200812005)

摘  要:对中文地名识别进行了研究,提出了一种结合多知识的地名识别方法,该方法首先以条件随机场模型为框架,充分利用地名的外部特征和内部颗粒特征,将局部特征、复合特征以及专家知识相融合进行中文地名识别;在此结果上,利用构建的专家规则库对实验结果进行修正。实验结果表明,本文的方法是有效的,实验语料为1998年1月的《人民日报》,开放测试准确率、召回率、和F-值分别达到了93.64%、90.36%、92.03%。Chinese location name recognition is researched in this paper, and a new approach is proposed to recognize Chinese location name, which combing multi-knowledge. Firstly, the approach makes full use of inner features and exterior features of location name, based conditional random fields model, where, combining local features, hybrid features, related features with expert knowledge to recognize Chinese location name. Then through the analysis of experimental results, a simple rule-base is constructed, which is used to optimize the experimental results. The experimental results show that the precision is 93.64%, the recall is 90. 36% and the F-measure is 92.03% in People's Daily (January, 1998), which prove the validity of this approach.

关 键 词:中文地名识别 命名实体识别 条件随机场 信息抽取 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象