利用扩展标记集的词结构分析被引量：2

A Word Structure Analysis by Extending the Word Tag Set

出　　处：《中文信息学报》2014年第5期39-45,82,共8页Journal of Chinese Information Processing

基　　金：国家自然科学基金青年项目(61202162);教育部博士点基金新教师类课题(20123201120011);国家863计划前沿技术研究类项目(2012AA011102)

摘　　要：该文给出了一种与传统分词不同的词法分析选择,提出了一种利用扩展标记集来实现词内部结构分析的方法。首先阐述了词的内部结构特点,把结构中的前后缀视为特殊的词,进而通过识别出每一个词的前后缀来识别词的内部结构。方法是把词内部结构识别问题转换成序列标注问题,通过扩展标记集,采用CRF模型来实现词的内部结构分析。最终实验表明,无论是在总体性能上,还是在各层结构的识别上都取得了较高的准确度。This paper proposes a different way of lexical analysis, to analyze the internal structures of words, and presents a word structure analysis method by extending the word tag set. First, we describe the characteristics of the internal structures of words, By treating the prefixes and suffixes within words structures as special words, we identify the internal structures of words through the detection of prefixes and suffixes. We convert the issue of iden- tifying the internal structures of words into the sequence tagging problem, adopting the CRF model to realize the words structures analysis using extending the word tag set. The experiment shows that they achieve higher accuracy both on overall performance and on the identification of each layer structure.

关键词：扩展标记集词结构分析前后缀序列标注问题

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

利用扩展标记集的词结构分析被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

利用扩展标记集的词结构分析 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

利用扩展标记集的词结构分析被引量：2