基于随机森林的图书馆馆藏文献自动分类方法  被引量:3

Automatic Classification of Library Collections Based on Random Forest

在线阅读下载全文

作  者:王清[1] WANG Qing(Shandong Jianzhu University,Jinan 250101 China)

机构地区:[1]山东建筑大学,山东济南250101

出  处:《自动化技术与应用》2022年第7期51-53,72,共4页Techniques of Automation and Applications

摘  要:为更好的实现图书馆文献管理,提出基于随机森林的图书馆馆藏文献自动分类方法。使用TFC权重算法提取文献特征,计算各特征权重。构建分类决策树,使用后剪枝算法控制文献初次分类精度。整合决策树结构生成文献分类器,结合边际函数完成随机森林文献分类算法。构建实验环节,实验结果表明:此方法具有较高的分类精度,可有效提升分类加速比和并行分类效果。In order to better realize library document management,an automatic classification method of library collection documents based on random forest is proposed.TFC weight algorithm is used to extract literature features and calculate the weight of each feature.The classification decision tree is constructed,and the post pruning algorithm is used to control the accuracy of literature primary classification.The document classifier is generated by integrating the decision tree structure,and the random forest document classification algorithm is completed combined with the marginal function.The experimental link is constructed,the experimental results show that this method has high classification accuracy,can effectively improve the classification acceleration ratio and the parallel classification effect.

关 键 词:决策树 图书馆管理 文本分类 随机森林 剪枝算法 加速比 

分 类 号:TP309.1[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象