自然语言词性序列的分类  

The Classification of Lexical Category Sequence of Natural Language

在线阅读下载全文

作  者:徐芃[1] 熊健[2] 

机构地区:[1]广州大学心理咨询中心,广州510006 [2]广州大学经济与统计学院,广州510006

出  处:《华南师范大学学报(自然科学版)》2014年第4期110-115,共6页Journal of South China Normal University(Natural Science Edition)

基  金:教育部人文社会科学研究规划基金项目(11YJAZH106)

摘  要:采集142份主题句作业自然语言语料数据,利用中文自然语言处理平台构造自然语言的词性序列;经过语言结构粗粒化处理,建构由名词、动词、形容词和代词等4种实词构成的词性序列分类模型.研究结果显示,基于词性含量的自然语言词性序列分类模型的准确率达到90%;基于词序位置的自然语言词性序列的分类模型的准确率达到了95%.自然语言的词性序列分类模型在语言认知领域具有较好的应用价值,不仅可以揭示和证实语言与心理信息之间存在的相关关系,而且可以通过客观的语言符号对内隐的心理信息做出科学的评估.After collecting 142 sample datasets from a topic sentence natural language corpus, lexical category sequencing of natural language is constructed by a Chinese natural language processing platform. The lexical category sequence classification model is composed of noun, verb, adjective and pronoun after the coarse graining processing of language structure. The results show that the lexical category sequence classification model based on the lexical category content do archive the accuracy of 90% , and the same model based on the position of word order do archive the accuracy of 95%. Thus, it proves that the lexical category sequence of natural model would be valuable in linguistic cognitive. It not only reveals the relationship between ogy information, but also assesses the implicit psychology information scientifically notations. through language classification language and psychology information, but also assesses the implicit psychology information scientifically through the objective linguistic notations.

关 键 词:自然语言 心理信息 词性 词序 分类 

分 类 号:O212.1[理学—概率论与数理统计]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象