不确定大数据流分类的决策树模型构建仿真  被引量:1

Simulation of Decision Tree Model Construction for Uncertain Big Data Flow Classification

在线阅读下载全文

作  者:杨知玲[1] 谭树杰 YANG Zhi-ling;TAN Shu-jie(Zhujiang College of South China Agricultural University,Guangzhou Guangdong 510900,China;Jiangxi Science and Technology Normal University,College of Communication and Electronics,Nanchang Jiangxi 330013,China)

机构地区:[1]华南农业大学珠江学院,广东广州510900 [2]江西科技师范大学通信与电子学院,江西南昌330013

出  处:《计算机仿真》2024年第5期532-535,542,共5页Computer Simulation

基  金:2022年度广东省教育科学规划课题(高等教育专项)(2022GXJK404);2021年广东省青年创新人才类项目(2021WQNCX156);北方国际大学联盟第六期教育教学研究课题(20210608004);2021年广东省青年创新人才类项目(2021WQNCX136);2022年广东省本科高校教学质量与教学改革工程建设项目(粤教高函[2023]4号)。

摘  要:在不确定大数据流分类过程中,受噪声和孤立点的干扰,导致处理效果和分类精度无法达到预期要求。为解决上述问题,提出一种基于决策树模型的不确定大数据流分类算法。通过采用在线字典学习算法,对不确定大数据流去噪处理,消除噪声对分类过程产生的干扰。构建决策树,在剪枝过程中通过特征过滤算法,滤除不确定大数据流中掺杂的孤立点。将去噪后的不确定大数据流,输入决策树模型中,完成分类工作。实验结果表明,所提算法处理后的不确定大数据流振幅明显减小,且分类精度高,具有一定的应用价值。In the process of uncertain big data stream classification,the effect and classification accuracy are unable to meet the expected requirements due to the interference of noise and isolated points.Therefore,an algorithm of classifying uncertain big data stream based on decision tree model was proposed.First of all,the online dictionary learning algorithm was adopted to reduce the noise from uncertain big data stream,and thus to eliminate the noise interference in the classification process.Moreover,a decision tree was constructed.Furthermore,feature filtering algorithm was adopted to filter the isolated points doped in the uncertain big data stream in the pruning process.Finally,the uncertain big data stream after denoising was input into the decision tree model,thus completing the classification.The experimental results show that the amplitude of the uncertain big data stream processed by the proposed algorithm is significantly reduced.In addition,the method has high classification accuracy and application value.

关 键 词:决策树模型 在线字典学习算法 特征过滤 不确定大数据流 数据分类 

分 类 号:TP393[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象