基于分布式多关联属性的高维数据差分隐私保护方法  

Differential privacy protection method of multi-associated attribute based on distributed high dimensional data

在线阅读下载全文

作  者:褚治广 李俊燕[2] 陈昊 张兴 CHU Zhi-guang;LI Jun-yan;CHEN Hao;ZHANG Xing(Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China;Key Laboratory of Security for Network and Data in Industrial Internet of Liaoning Province,Liaoning University of Technology,Jinzhou 121001,China)

机构地区:[1]北京工业大学信息学部,北京100124 [2]辽宁工业大学辽宁省工业互联网网络与数据安全重点实验室,辽宁锦州121001

出  处:《计算机工程与设计》2024年第4期967-973,共7页Computer Engineering and Design

基  金:国家自然科学基金项目(61802161);辽宁省教育厅科学研究基金项目(JZL202015404、LJKZ0625)。

摘  要:针对高维数据发布的过程中存在由多关联属性引发的隐私信息泄露风险问题,在分布式环境下提出一种满足差分隐私保护的多关联属性高维数据发布方法(HDMPDP)。根据数据维度,提出一种基于分布式划分的粗糙集高效降维方法,完成对高维复杂数据特征属性的划分,降低数据维度的同时提高处理效率;设计属性分类准则,利用属性信息熵改进关联分析方法;对得到的属性分别进行加噪,优化噪声添加的方式,减轻关联属性带来的隐私问题。在Spark分布式框架下实现隐私保护数据发布,通过高维数据实验验证了该方法的有效性和隐私保护的安全性。To solve the problem of privacy information leakage caused by multi associated attributes in the publishing process of high-dimensional data sets,a multi associated attribute high-dimensional data privacy protection method(HDMPDP)was proposed in distributed environment.According to the data dimension,an efficient dimensionality reduction method of rough set based on distributed partition was proposed,to complete the division of high-dimensional complex data feature attributes,reduce the data dimension and improve the processing efficiency.The attribute classification criterion was designed,and the attribute information entropy was used.The associated analysis method was improved.The noise was added to the obtained attributes respectively,the way of adding noise was optimized,and the privacy problem caused by associated attributes was alleviated.The privacy-preserving data release was realized under the Spark distributed framework,and the effectiveness of the method and the security of privacy-preserving were verified through high-dimensional data experiments.

关 键 词:高维数据 多关联属性 差分隐私 分布式 关联分析 粗糙集 隐私保护 

分 类 号:TP309.2[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象