检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]华东理工大学计算机科学与工程系,上海200237 [2]浙江师范大学数理与信息工程学院,浙江金华321004
出 处:《电子学报》2008年第10期2021-2029,共9页Acta Electronica Sinica
基 金:国家自然科学基金(No.60773094,No.60473055);上海市曙光计划(No.07SG32);上海市浦江人才计划(No.05PJ14030)
摘 要:数据表的k-匿名化(k-anonymization)是数据发布时保护私有信息的一种重要方法.泛化/隐匿是实现k-匿名的传统技术,然而,该技术存在效率低、k-匿名化后数据的可用性差等问题.近年来,微聚集(Microaggregation)算法被应用到数据表的k-匿名化上,弥补了泛化/隐匿技术的不足,其基本思想是:将大量的数据按相似程度划分为若干类,要求每个类内元组数至少为k个,然后用类质心取代类内元组的值,实现数据表的k-匿名化.本文综述了微聚集算法的基本思想、相关技术和当前动态,对现有的微聚集算法进行了分类分析,并总结了微聚集算法的评估方法,最后对微聚集算法的研究难点及未来的发展趋势作了探讨.K-anonymization of tables is a method to prevent private information from disclosure prior to publication, which is achieved traditionally via generalization/suppression techniques. However, these methods have some defects on efficiency, availability, etc. Recently, microaggregation algorithm is proposed as an alternative to generalization/suppression method for k-anonymization whose goal is to cluster a set of records into groups of size at least k such that groups are as homogeneous as possible. Then the records'attribute values in the same group are replaced by the group's centroid. Microaggregation algorithms'core ideas, the stateof-the-art and related techniques are surveyed. The existing algorithms are classified and analyzed. Evaluation methods of microaggregation algorithms are investigated. Finally, some open problems and the research directions in this area are discussed.
分 类 号:TP309.2[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.180