检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机工程与应用》2007年第7期188-190,197,共4页Computer Engineering and Applications
摘 要:针对Apriori和AprioriTid算法中存在的项集生成瓶颈问题,提出了一种基于事务集压缩、候选项集压缩和支持度布尔矩阵的改进AprioriTid算法。该算法中通过删去不必比较的事务来有效缩减数据集;优化频繁项集的自连接方式来减少生成的候选项集个数;使用支持度布尔矩阵来加快候选项集的验证速度。实验结果表明改进算法确实能有效减少相关计算量,比已有算法执行效率明显提高,同时验证了该算法在旋转机械故障诊断中的有效性。The efficiency of mining association rules is an important field of Knowledge Discovery in Databases.In this paper we have proposed an improved AprioriTid algorithm with transactions reduction,candidate itemsets reduction and support matrix to solve the bottleneck of itemsets generation.The highly efficient method described in this paper minimizes the database by deleting many transactions which need not be scanned.We also show a method to reduce the number of candidate itemsets by optimizing the join procedure of frequent itemsets and a support matrix method to accelerate the verification speed of candidate itemsets is put forward.To this end,the IAT algorithm for mining frequent itemsets,which is the improvement algorithm of AprioriTid,is designed in this article.The experiment results of the algorithm show that the improved algorithm can decrease related computation quantity in large scale and improve the efficiency of the algorithm.The simulation results of knowledge acquisition for fault diagnosis also show the validity of IAT algorithm.
关 键 词:数据挖掘 关联规则 APRIORITID算法 频繁项集
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30