藏语词语兼类情况及识别规则库  

Multi-Category Words of Tibetan and the Recognition Rulebase

在线阅读下载全文

作  者:完么扎西[1] 

机构地区:[1]青海师范大学民族师范学院,青海海南813000

出  处:《西藏大学学报(社会科学版)》2014年第5期87-94,共8页Journal of Tibet University

摘  要:同其他语言一样藏语词性的兼类现象普遍存在,这给词性标注工作带来了巨大困难,对兼类词的处理是藏语词性标注的关键所在。文章利用传统和现代藏语语法理论,在分析藏语真实文本的基础上,归纳了藏语兼类词的种类,提出了兼类词的标注原则。并根据词语搭配关系和词的组合结构构建了兼类词的识别规则库,利用该规则库可对兼类词的词性进行较准确的标注。The multi-category phenomenon of speech are ubiquitous in Tibetan language as in other languagesand it has brought great difficulties in the speech tagging work.Therefore,the processing multi-category words isone of the key problem in Tibetan speech tagging.In the present paper,the types of multi-category words in Ti-betan language were summarized and the tagging principle of multi-category words was proposed based on ana-lyzing the true text of Tibetan with the traditional and modern Tibetan grammar theory.According to the colloca-tion relations of expressions and combined structure of Tibetan words,a recognition rulebase of multi-categorywords was developed and that can tag the speech of the multi-category words accurately.

关 键 词:藏文信息处理 兼类词 标注原则 识别规则库 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象