(k,l)-多样性数据发布研究被引量：4

Achieving(k,l)-Diversity in Privacy Preserving Data Publishing

机构地区：[1]安徽理工大学计算机科学与工程学院,淮南232001 [2]哈尔滨工程大学计算机科学与技术学院,哈尔滨150001

出　　处：《计算机科学》2013年第8期140-145,共6页Computer Science

基　　金：国家自然科学基金(61073043;61170060);安徽高等学校省级自然科学基金(KJ2011Z098)资助

摘　　要：发布未经处理的数据会导致身份泄露和敏感属性泄露,通过概化准标识符可以达到隐私保护的目的,但信息损失过大。针对该问题提出一种基于聚类的(k,l)-多样性数据发布模型并设计算法予以实现。通过使用概率联合分布度量数据对象的离散属性和连续属性相似性,提高了数据的效用。详细论述了簇的合并、调整和概化策略,结合参数k和l提出隐私保护度概念,指出了基于聚类的最优化(k,l)-多样性算法是NP-难问题,并分析了算法的复杂度。理论分析和实验结果表明,该方法可以有效减少执行时间和信息损失,提高查询精度。In order to avoid disclosure of individual identity and sensitive attribute,reduce the information loss when da- ta release, a clustering-based algorithm to achieve（k, l）-diversity（CBAD）in data publishing was presented. The discrete attributes and continuous attributes mixed in the data set were fully taken into account while clustering. The probability distribution was used as metrics to measure similarity between the data objects. We solved the confusion of the informa- tion loss and the distance between data objects, pointed out that the clustering-based optimization（k,/）-diversity algo- rithrn is NP-hard problem, proposed the concept of privacy protection degree with parameter k and l, and analysed the complexity of the algorithm. Theoretical analysis and experimental results show that the method can effectively reduce the execution time and information loss, improve query precision.

关键词：隐私保护数据发布 l-多样性数据效用聚类相似性度量

分类号：TP309.2[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

(k,l)-多样性数据发布研究被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

(k,l)-多样性数据发布研究 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

(k,l)-多样性数据发布研究被引量：4