检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘诗瑾[1,2] 杨知玲 LIU Shi-jin;YANG Zhi-ling(Zhujiang College of South China Agricultural University,Guangzhou Guangdong 510600,China;Wuhan University,Wuhan Hubei 430072,China)
机构地区:[1]华南农业大学珠江学院,广东广州510600 [2]武汉大学,湖北武汉430072
出 处:《计算机仿真》2024年第8期513-516,534,共5页Computer Simulation
基 金:广东省教育厅本科高校教学质量与教学改革工程项目(粤教高函【2023】4号-1084)。
摘 要:多源异构数据可能来自不同领域、不同格式和不同质量的数据源,处理难度较大,针对多源异构数据难以精准挖掘的问题,提出基于决策树分类的多源异构数据挖掘算法。构建决策树划分数据属性,对初始决策树实施剪枝处理,得出多源异构数据属性集,提取出多源异构数据因子,获取粗略的数据挖掘结果。再使用深度学习算法进一步挖掘出其余数据中残存的多源异构数据,并对原始多源异构数据集实施二次挖掘,将粗细挖掘结果整合后实现多源异构数据挖掘。实验结果表明,所提算法的F1值较高,泛化误差较低,数据挖掘性能较强。Multi-source heterogeneous data may come from different fields and have different formats.In addition,the data source may have different qualities,so it is difficult to process multi-source heterogeneous data.To address the problem of difficulty in accurately mining heterogeneous data from multiple sources,this paper presented a multi-source heterogeneous data mining algorithm based on decision tree classification.At first,we constructed a decision tree to partition data attributes,and then pruned the initial decision tree,thus obtaining an attribute set of multi-source heterogeneous data.Moreover,we extracted the data factors,and thus to obtain rough data mining results.Furthermore,we used the deep learning algorithm to mine the remaining multi-source heterogeneous data,and then implemented secondary mining on the original multi-source heterogeneous dataset.Finally,we achieved the multisource heterogeneous data mining after integrating the coarse and fine mining results.Experimental results show that the proposed algorithm has high F1 value,low generalization error,and strong data mining performance.
关 键 词:决策树 数据分类 多源异构数据 数据挖掘 深度学习算法
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145