检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:陈柳 冯山[1] CHEN Liu;FENG Shan(College of Mathematics alut Software Science,Sichuan Normal University,Chengdu Sichuan 610068,China)
机构地区:[1]四川师范大学数学与软件科学学院,成都610068
出 处:《计算机应用》2018年第5期1315-1319,1338,共6页journal of Computer Applications
基 金:国家自然科学基金资助项目(61673285);四川省教育厅自然科学重点基金资助项目(15ZB0029);四川省青年科技基金资助项目(2017JQ0046)~~
摘 要:针对传统正负关联规则置信度阈值设置方法难以控制低可信度规则数量和易遗漏有趣规则的问题,提出了一个结合项集相关性的两级置信度阈值设置方法(PNMC-TWO)。首先,基于规则的无矛盾性、有效性和有趣性考虑,以相关度-支持度-置信度为框架,从规则置信度与项集支持度的计算关系出发,系统地分析了正负关联规则置信度取值随规则的项集支持度大小变化的规律;然后,与实际挖掘中用户对高可信度且有趣的规则需求相结合,提出了一个新的设置模型,避免了传统方法设置阈值时的盲目性和随意性;最后,从规则数量和规则质量两方面对所提方法与原双阈值法进行了实验对比。实验结果表明,所提方法不仅可以更好地确保提取出的关联规则有效和有趣,还可以显著地降低可信度低的关联规则数量。Aiming at the problem that traditional confidence threshold setting methods for positive and negative association rules are difficult to limit the number of low-reliability rules and easy to miss some interesting association rules, a new twolevel confidence threshold setting method combined with the rule's itemset correlation was proposed, called PNMC-TWO.Firstly, taking into account the consistency, validity and interestingness of rules, under the framework of correlation-supportconfidence, on the basis of the computation relationship between rule confidence and itemset support of the rule, the law of confidence of rule changing with support of itemsets of the rule was analyzed systematically. And then, combined with the user's requirement of high confidence and interesting rules in actual mining, a new confidence threshold setting model was proposed to avoid the blindness and randomness of the traditional methods when setting the threshold. Finally, the proposed method was compared with the original two-threshold method in terms of the quantity and quality of the rule. The experimental results show that the new two-level threshold method not only can ensure that the extracted association rules are more effective and interesting, but also can reduce the number of low-reliability rules significantly.
关 键 词:数据挖掘 正负关联规则 规则置信度阈值 项集相关性
分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.137.159.3