检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:姜苗[1,2] 倪志伟[1,2] 孟金华[1,2] 周之强[1,2]
机构地区:[1]合肥工业大学管理学院,安徽合肥230009 [2]过程优化与智能决策教育部重点实验室,安徽合肥230009
出 处:《中国科学技术大学学报》2011年第8期739-745,共7页JUSTC
基 金:国家自然科学基金(70871033);中国高技术研究发展(863)计划(2007AA04Z116)资助
摘 要:在数据流闭频繁项集挖掘过程中,常忽略历史模式对挖掘结果的影响,并采用一种结构来标记闭频繁项集的类型,导致算法的效率不高.为此提出一种挖掘数据流时间窗口中闭频繁项集的方法NEWT-moment.该方法能在单遍扫描数据流事务的条件下完整地记录模式信息.同时,NEWT-moment提出的剪枝方法能很好地降低滑动窗口树F-tree的空间复杂度与闭频繁模式树NEWT-tree的维护代价.此外,该方法提出的时间衰减机制能区分历史和最新模式对挖掘结果的影响;并且,NEWT-tree直接存储闭频繁项集,可随时快速读取闭频繁项集.与T-moment算法相比,算法不需要删除历史数据,不需要记录事务时标,标记各节点,降低了算法的时间和空间复杂度.大量实验结果表明,NEWT-moment有很好的效率和准确性.When mining closed frequent itemsets over data streams,the available algorithms are often made inefficient due to the fact that they often ignore mode decaying as time passes,and adopt a structure to mark the types of closed frequent itemsets.A method was proposed for mining the closed frequent patterns in the time window of data streams.The pattern of data streams could be completely recorded by scanning the streams only once.And the pruning method of NEWT-moment could reduce the space complexity of sliding window tree and the maintenance cost of the closed frequent patterns tree.To differentiate the historical and the latest patterns,a time decaying model was applied.Additionally,NEWT-tree stores the closed frequent itemsets directly,so they can be read quickly.In contrast with T-moment,and NEWT-moment does not need to delete the historical data,or mark transaction and nodes,which can decrease the time complexity and the space complexity.The experimental results show that the algorithm has good efficiency and accuracy.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249