计算机彝文自动分词技术的设计研究被引量：4

Design and Implementation of Yi Automatic Segmentation Technique in Computer

作　　者：王成平[1]

机构地区：[1]西南民族大学民族语言文字信息处理实验中心,四川成都610041

出　　处：《湘潭大学自然科学学报》2012年第3期107-113,共7页Natural Science Journal of Xiangtan University

基　　金：中央高校基本科研业务费专项资金项目09SZYZJ04);国家社科基金项目(06XYY021;07BYY060);归国留学人员创新基金资助项目(09SLX03)

摘　　要：实现彝语文自动分词是计算机彝文信息处理中一项不可缺少的基础性工作,计算机彝文信息处理只要涉及到信息检索、机器翻译、语法分析、语义分析等方面的应用,就都需要以词为基本的处理单位.论文以彝语言的特点作为出发点,首先提出了计算机彝文分词规则与分词词表的设计思路,其次提出了实现计算机彝文自动分词技术的算法基础、系统结构,以及实现流程,而且进行了抽样测试,其分词的速度和准确率都比较高.论文最后根据彝语言的特点对实现计算机彝文自动分词的难点进行了分析.The automatic word segmentation is an indispensable basic work of Yi language information processing. As long as Yi language information processing related to the retrieval, translation, syntactic a- nalysis , semantic analysis,it requires the use of word as basic unit. On this basis according to characteris- tics of Yi language,the automatic word segmentation standard and design of word vocabulary are described. The technology of automatic word segmentation is proposed, which based on established vocabulary of Yi language. The technology includes algorithm selection, system architecture,and implementation process. And sample tests are given, the accuracy rate and speed of word segmentation are quite satisfactory. Finally, on characteristics of Yi language and the difficulty of achieve automatic word segmentation is analyzed.

关键词：彝语文自动分词算法测试评价难点分析

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

计算机彝文自动分词技术的设计研究被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

计算机彝文自动分词技术的设计研究 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

计算机彝文自动分词技术的设计研究被引量：4