基于聚类的l-多样性匿名方法

I-diversity anonymity method based on clustering

出　　处：《燕山大学学报》2012年第1期32-38,共7页Journal of Yanshan University

基　　金：河北省自然科学基金资助项目(F2011203219)

摘　　要：-多样性(I-diversity)模型采用传统基于概念层次结构的数据概化策略,在对敏感属性进行匿名保护时往往会造成不必要的信息损失。针对这一问题,将聚类技术引入数据匿名中,提出一种基于聚类的I-diversity匿名保护方法。该方法在满足I-diversity模型的约束条件下,采用基于距离的层次化聚类算法划分元组,对不同类型的准标识符使用不同的概化策略,并依据数据概化前后属性值不确定性程度的变化描述数据概化带来的信息损失。同现有的I-diversity模型相比,该方法能较好地保护用户的敏感属性,并且在一定程度上降低了概化处理带来的信息损失。The traditional data generalization strategy of l-diversity model is based on the concept-hierarchy structure, but this kind of data generalization strategy may cause some unnecessary loss of information as taking measure of anonymous protection to sensitive attributes. To solve this problem, the technique of cluster for data anonymity is adopted and a corresponding anonymous protection method is proposed. Under the constraint condition of /-diversity model, the new method makes partition oftuple ac- cording to the hierarchical clustering algorithm based on distance, take different generalization strategy for different kinds of iden- tifiers, and describe the loss ofinformation caused by data generalization according to the change of uncertainty degree of attributes. By contrast with the original model, the new model proposed in this paper performs better than in the protection of customer＇s sen- sitive attribute and can reduce the information loss caused by the generalization to some degree.

关键词：匿名保护数据概化信息损失聚类I-diversity

分类号：TP311[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于聚类的l-多样性匿名方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于聚类的l-多样性匿名方法

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索