基于改进倒排表和集合的最频繁项集挖掘算法被引量：1

Most frequent itemset mining algorithm based on improved inverted list and set theory

机构地区：[1]南阳理工学院计算机科学与技术系,河南南阳473004 [2]郑州轻工业学院计算机与通信工程学院,郑州450002

出　　处：《计算机应用研究》2012年第6期2135-2137,共3页Application Research of Computers

基　　金：河南省教育厅自然科学研究指导计划项目(2010C520007)

摘　　要：最频繁项集挖掘是文本关联规则挖掘中研究的重点和难点,它决定了文本关联规则挖掘算法的性能。针对当前在最频繁项集挖掘方面的不足,将集合论引入倒排表以对其进行改进,然后以此为基础提出了几个命题和推论,并结合最小支持度阈值动态调整策略,提出了一个基于改进的倒排表和集合理论的最频繁项集挖掘算法,最后对所提算法进行验证。实验结果表明,所提算法的规则有效率和时间性能比常用的两个最频繁项集挖掘算法,即NApriori和IntvMatrix算法都好。Most frequent item sets mining is the focus and the difficulty of text association rules mining,and directly determines the performance of text association rules mining algorithms.Aiming at shortcomings existing in most frequent item sets mining algorithms,this paper improved traditional inverted list,it combined minimum support threshold dynamic adjustment strategy and presented a new most frequent itemset mining algorithm based on improved inverted list and set theory.In addition,it also offered several propositions and deductions which were used to improve the performance of the provided algorithm.Finally,through experiment testing,the provided algorithm is better in effective rate of rules and time performance than NApriori and IntvMatrix which are two frequently-used most frequent itemsets mining algorithms.

关键词：最频繁项集文本关联规则倒排表集合理论

分类号：TP301[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进倒排表和集合的最频繁项集挖掘算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进倒排表和集合的最频繁项集挖掘算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于改进倒排表和集合的最频繁项集挖掘算法被引量：1