基于随机森林的N1+N2结构语法关系判定方法研究  被引量:5

Research on Judging Method of N1+N2 Structure Grammatical Relation Based on Random Forest

在线阅读下载全文

作  者:杨泉[1] YANG Quan(College of Chinese Language and Culture,Beijing Normal University,Beijing 100875,China)

机构地区:[1]北京师范大学汉语文化学院,北京100875

出  处:《重庆理工大学学报(自然科学)》2021年第7期125-130,共6页Journal of Chongqing University of Technology:Natural Science

基  金:国家语委科研项目(YB135-91)。

摘  要:提出了一种基于随机森林的N1+N2结构语法关系分类判定方法,在自建熟语料库的基础上,为每个短语结构建立用于分类决策树的7个特征,使用C4.5方法生成决策树,构造随机森林算法,通过投票原则给出最终判断结果。经训练集学习后,在含有1 020条语料的测试集中进行测试,正确率达到94.8%。结果表明:使用随机森林算法进行汉语短语结构语法关系分类判定是行之有效的。Judging the grammatical relation of phrase structure is a bottleneck problem in natural language processing,which can be attributed to the classification problem in machine learning.Based on the self-built corpus,a classification and judgment method of N1+N2 structural grammatical relations based on random forest is proposed.Five features are randomly selected from the feature set as the judgment criteria,and 21 decision trees are used as the final judgment result.After learning the training set,the test is carried out in a test set containing 1020 corpus,and the final test result accuracy reaches 94.8%.The results show that it is effective to use random forest algorithm to classify and judge the grammatical relations of Chinese phrase structures.

关 键 词:随机森林 决策树 短语层级 语法关系 词义相似度 人工智能 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象