检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:单芝慧 韩萌[1] 韩强[1] SHAN Zhihui;HAN Meng;HAN Qiang(School of Computer Science and Engineering,North Minzu University,Yinchuan Ningxia 750021,China)
机构地区:[1]北方民族大学计算机科学与工程学院,银川750021
出 处:《计算机应用》2023年第7期2049-2056,共8页journal of Computer Applications
基 金:国家自然科学基金资助项目(62062004,61862001);宁夏自然科学基金资助项目(2020AAC03216)。
摘 要:高效用项集(HUI)挖掘能够提供数据集中高利润的项的组合信息,有利于在现实应用中制定有效的营销策略。然而,HUI仅提供项集及其总效用,不提供单个项的购买数量,而现实场景中项的数量能提供更精准的信息。因此,研究者提出定量高效用项集(HUQI)挖掘算法。针对当前的HUQI挖掘算法仅能处理静态数据且存在结果集冗余的问题,提出增量更新的定量效用列表结构来存储并更新数据集中项的效用信息,并基于该结构提出一种挖掘闭合定量高效用项集(CHUQI)的算法。将所提出的算法与FHUQI-Miner(Faster High Utility Quantitative Itemset Miner)算法在结果集数量、最小效用阈值、批次数目以及可扩展性上对比时间与内存消耗。实验结果表明,所提算法能够有效处理增量数据,挖掘出更有趣的项集。High Utility Itemset(HUI)mining can provide information about the combination of highly profitable items in a dataset,which is useful for developing effective marketing strategies in real-world applications.However,HUIs only provide the itemsets and their total utility,not the purchased numbers of individual items,and the numbers of items in a real scenarios provide more precise accurate information.Therefore,High Utility Quantitative Itemset(HUQI)mining algorithms have been proposed by researchers.Focusing on the issue that the current HUQI mining algorithms can only process static data and have the problem of redundant resultsets,an incrementally updated quantitative utility list structure was proposed for storing and updating the utility information of items in the dataset,and based on this structure,an algorithm for mining Closed High Utility Quantitative Itemset(CHUQI)was proposed.The time and memory consumption of the proposed algorithm was compared with that of Faster High Utility Quantitative Itemset Miner(FHUQI-Miner)algorithm in terms of the number of result sets,minimum utility threshold,number of batches,and scalability.Experimental results show that the proposed algorithm can process incremental data effectively and mine more interesting itemsets.
关 键 词:增量挖掘 高效用项集 定量高效用项集 闭合高效用项集 效用列表
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.222.252.132