检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]北京邮电大学智能科学与技术中心,北京100876
出 处:《北京邮电大学学报》2014年第6期120-124,共5页Journal of Beijing University of Posts and Telecommunications
基 金:国家自然科学基金项目(61273365)
摘 要:面向英语文章的词性标注是对英语文章实现自动批改的基础,虽然研究者对英语词性标注做了大量有益的研究,但是大多数的研究都面向英语为第一语言的用户,而面向英语为第二语言用户的相关研究则很少.为此,对以英语为第二语言用户的英语文章进行了人工标注,在此基础上提出了一种面向英语文章的词性标注算法,融合了词聚类、无标语料统计信息、单词发音等特征.实验结果表明,该算法能有效提高词性标注性能,标注正确率从94.49%可提高到97.07%.Part-of-speech tagging for Chinese English learner language is the base of automated essay scoring system. Much of fruitful part-of-speech tagging researches researchers was done,however,most of them are focused on the English essays written by native speaker,there is no research about essays of Chinese English learner. A corpus of Chinese English learner essay are annotated,and a part-of-speech tagging algorithm for Chinese English learner language is presented. This algorithm uses rich features,such as unsupervised word clusters,unsupervised tag dictionary and phonetic normalization. Based on these rich features,the system outperforms the state-of-art tagging on the corpus,and the tagging accuracy is raised from 94. 49% to 97. 07%.
分 类 号:TN911.22[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249