检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机应用》2010年第7期1922-1925,共4页journal of Computer Applications
基 金:江苏省自然科学基金资助项目(BK20003017)
摘 要:基于FP-tree的最大频繁模式挖掘算法是目前较为高效的频繁模式挖掘算法,针对这些算法需要递归生成条件FP-tree、做超集检验等问题,在分析DMFIA-1算法的基础上,提出了最大频繁模式的非检验挖掘算法NCMFP。该算法改进了FP-tree的结构,使挖掘过程中不需要生成条件频繁模式树也不需要超集检验。算法采用的预测剪枝策略减少了挖掘的次数,采用的求取公共交集的方式保证了挖掘结果的完整性。实验结果表明在支持度相对较小情况下,NCMFP的效率是同类算法的2~5倍。The algorithms based on FP-tree,for mining maximal frequent patterns,have high performance but with many drawbacks.For example,they must recursively generate conditional FP-trees,have to do the process of superset checking.In order to overcome these drawbacks of the existing algorithms,an algorithm Non-Check Mining algorithm of Maximum Frequent Pattern(NCMFP)for mining maximal frequent patterns was put forward after the analysis of DMFIA-1 algorithm.In the algorithm,neither constructing conditional frequent pattern tree recursively nor superset checking was needed through modifying the structure of FP-tree.This algorithm reduced the number of mining through early prediction before mining.The application of a method to get the public intersection sets could obtain a complete result.The experiment shows that the efficiency of NCMFP is two to five times as much as that of the similar algorithms in the case of a relatively small support.
关 键 词:关联规则 数据挖掘 频繁模式树 最大频繁项集 超集检验
分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145