保护私有信息的统计量化规则挖掘(英文)  

Privacy-preserving Statistical Quantitative rules mining

在线阅读下载全文

作  者:荆巍巍[1] 黄刘生[2] 姚亦飞[1] 徐维江[1] 

机构地区:[1]中国科学技术大学计算机科学与技术系,合肥230027 [2]国家高性能计算中心(合肥),合肥230027

出  处:《中国科学院研究生院学报》2008年第6期771-780,共10页Journal of the Graduate School of the Chinese Academy of Sciences

基  金:the National Natural Science Foundation(60703071,60773032);the Science Foundation of Jiangsu Province(BK2007060);the Science Foundation of Anhui Province(070412043)

摘  要:统计量化规则(SQrule)在数据挖掘中拥有重要和有用的地位.尽管集中式挖掘SQ规则的算法已经存在,但是集中式算法不能简单应用到分布式环境中,尤其涉及到分布式环境中各方的私有信息保护的时候.考虑数据分布共享的多方,在不泄漏各自的私有信息的情况下,合作完成SQ规则的挖掘问题.该问题属于保护私有信息的数据挖掘(PPDM)研究领域的问题.基于3个PPDM的基本工具,包括安全求和、安全求平均和安全求频繁项集的集合等,提交2个算法,共同完成水平划分数据下的保护私有信息的SQ规则挖掘.其中,一个算法安全计算置信区间,该区间用来检验规则的重要性;另一个算法安全挖掘规则.最后,给出算法的正确性、安全性和复杂性分析.Statistical Quantitative (SQ) rule plays an important and useful role in data mining. Centralized algorithms have been presented for SQ rules mining. However, the algorithms cannot be easily applied to mining SQ rules on distributed data, where privacy of parties becomes great concerns. This paper considers the problem of mining SQ rules without revealing the private information of parties who compute jointly and share distributed data. The issue is an area of Privacy-Preserving Data Mining (PPDM) research. Based on several basic tools for PPDM, including secure sum, secure mean and secure frequent itemsets, this paper presents two algorithms to accomplish privacy-preserving SQ rules mining over horizontally partitioned data. One is to securely compute confidence intervals for testing the significance of rules ; the other is to securely discover SQ rules. Besides, the analysis of the correctness, the security and the complexity of our algorithms are provided.

关 键 词:安全多方计算 保护私有信息的数据挖掘 统计量化规则 

分 类 号:TP309[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象