检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]华北电力大学电子与通信工程系,河北保定071003 [2]华北电力大学计算机系,河北保定071003
出 处:《电脑开发与应用》2009年第8期26-28,共3页Computer Development & Applications
基 金:华北电力大学博士学位教师科研基金资助(200812005)
摘 要:对中文地名识别进行了研究,提出了一种结合多知识的地名识别方法,该方法首先以条件随机场模型为框架,充分利用地名的外部特征和内部颗粒特征,将局部特征、复合特征以及专家知识相融合进行中文地名识别;在此结果上,利用构建的专家规则库对实验结果进行修正。实验结果表明,本文的方法是有效的,实验语料为1998年1月的《人民日报》,开放测试准确率、召回率、和F-值分别达到了93.64%、90.36%、92.03%。Chinese location name recognition is researched in this paper, and a new approach is proposed to recognize Chinese location name, which combing multi-knowledge. Firstly, the approach makes full use of inner features and exterior features of location name, based conditional random fields model, where, combining local features, hybrid features, related features with expert knowledge to recognize Chinese location name. Then through the analysis of experimental results, a simple rule-base is constructed, which is used to optimize the experimental results. The experimental results show that the precision is 93.64%, the recall is 90. 36% and the F-measure is 92.03% in People's Daily (January, 1998), which prove the validity of this approach.
关 键 词:中文地名识别 命名实体识别 条件随机场 信息抽取
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.70