基于属性标记的专有名词自动识别研究

Recognition of Chinese Proper Noun Based on Attribute Tag

出　　处：《计算机技术与发展》2006年第11期195-198,共4页Computer Technology and Development

摘　　要：提出了一种新的基于属性标记的专有名词统一识别方法。其基本思想是:根据专有名词的成词特点,利用标注语料库,设定词语属性作为标准属性重新进行标注,在此语料基础上进行专有名词成词结构、成词环境的实例提取,并采用基于转换的错误驱动方法对提取的实例进行适用规则提取。在提取的实例和规则的基础上进行属性标注,是一种基于转换的错误驱动规则自学习方法与基于实例的学习方法相结合的基于浅层句法分析的一种新的识别专有名词的方法。实验证明该方法在测试样本集上准确率达到95.3%,召回率达到92.5%,是一种有效的专有名词识别方法。Introduces a new method to identify the Chinese proper noun. It is based on attribute tag, The basic thinking is ： according the characteristics about the Chinese proper noun compages, using label corpus, enact the words attribute to be the standard attribute and relabeled it. Based on the corpus,distilling the Chinese proper noun instances about compares configuration and compages environnwnt, using the transfomiation - based error- drive learning method to distill the fit regulation. Doing attribute label based on the instance and regulation which just distilled is the method combined the transfonnatkion- based error - drive learning and instance - based learning. Experiments proved this method ratio of nicety aehieved 95.3 % on testing stylebooks, the ratio of recall achied, 92.5 %,so it is an effcetive method to identify Chinese proper noun.

关键词：中文专有名词识别未登录词识别属性标注基于转换的错误驱动学习方法

分类号：TP391.1[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于属性标记的专有名词自动识别研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于属性标记的专有名词自动识别研究

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索