检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:盛昀瑶[1] 沈阳 Yun-yao SHENG;Yang SHEN(School of Information Engineering,Changzhou Vocational Institute Of Mechatronic Technology,ChangZhou 213164,China;Institute of Computer Systems,South China University of Technology,Guangzhou 510006,China)
机构地区:[1]常州机电职业技术学院信息工程学院,江苏常州213164 [2]华南理工大学计算机系统研究所,广州510006
出 处:《机床与液压》2018年第18期186-192,共7页Machine Tool & Hydraulics
摘 要:传统网络异构的数据挖掘算法以数据间的关联性为基础进行聚类,当数据集中出现大量冗余数据时,数据间的关联性减弱,使得数据挖掘精确度降低。为解决这个问题,提出一种新的弱关联冗余环境下挖掘算法。该算法先通过数据聚类方法,确定大数据集的原始聚类中心,不断更新聚类中心确保其逼近真实中心,实现大数据集的数据聚类。再对大数据集的弱关联规则进行挖掘,计算弱关联规则下数据间的关联性,采用弱化关联规则方法,挖掘出弱关联冗余环境下的数据。实验结果表明:所提挖掘算法具有较高的挖掘效率和精度,以及较低的复杂度。The traditional data mining algorithm for heterogeneous network is based on the correlation betw^een data to make clustering.When a large number of redundant data occur,the correlation between data is weakened and it makes the accuracy of data mining decrease.To solve this problem,a new^mining algorithm based on weakly correlation redundant environment is proposed in this paper.Firstly,in this algorithm,the original cluster center of the big data set is determined through the data clustering method,and the cluster center is updated to en sure that it is close to the real center,so as to realize the data clustering of big data set.Then,the weak association rule of big data set is mined to calculate the association betw^een the data in the weak association rule.Finally,the weak association rule is used to mine the data in the weak association redundant environment.The experimental results show^that the proposed mining algorithm has higher mining efficiency and accuracy,as well as low^er complexity.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.33