面向数据流的频繁模式挖掘研究被引量：1

Research on mining frequent patterns in data streams

作　　者：孟彩霞[1]

出　　处：《计算机应用研究》2009年第11期4054-4056,4059,共4页Application Research of Computers

基　　金：陕西省自然科学基金资助项目(2004f283);西安市科技创新支撑-应用发展研究计划资助项目(YF07024)

摘　　要：数据流的无限性、高速性使得经典的频繁模式挖掘方法难以适用到数据流中。针对数据流的特点,对数据流中频繁模式挖掘问题进行了研究,提出了数据流频繁模式挖掘算法FP-SegCount。该算法将数据流分段并利用改进的FP-growth算法挖掘分段中的频繁项集,然后利用Count-Min Sketch进行项集计数。算法解决了压缩统计和计算快速高效的问题。通过实验分析,FP-SegCount算法是有效的。Because of the limitless and high speed of data stream, classical frequent patterns mining method is difficult to extend to data stream. According to the characteristic of data streams, this paper proposed FP-SegCount algorithm for mining frequent patterns from data streams. The algorithm partitioned the data stream and used modified FP-growth algorithm to mining frequent itemsets in every segment. And then, counted itemsets in Count-Min Sketch. The algorithm solved the problem of compressed statistic and effective computation. Through＂ experimentation and analysis, FP-SegCount algorithm is efficient.

关键词：数据流数据挖掘数据流挖掘频繁模式

分类号：TP311[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

面向数据流的频繁模式挖掘研究被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

面向数据流的频繁模式挖掘研究 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

面向数据流的频繁模式挖掘研究被引量：1