检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:彭长根 赵园园[1,3] 樊玫玫 PENG Changgen;ZHAO Yuanyuan;FAN Meimei(State Key Laboratory of Public Big Data,College of Mathematics and Statistics,Guizhou University,Guiyang 550025,China;College of Computer Science and Technology,Guizhou University,Guiyang 550025,China;Institute of Cryptography&Data Security,Guzihou University,Guiyang 550025,China)
机构地区:[1]贵州大学数学与统计学院公共大数据国家重点实验室,贵阳550025 [2]贵州大学计算机科学与技术学院,贵阳550025 [3]贵州大学密码学与数据安全研究所,贵阳550025
出 处:《信息网络安全》2020年第2期37-48,共12页Netinfo Security
基 金:国家自然科学基金[U1836205,61662009,61772008,11761020];贵州省科技重大专项[[2018]3001,[2018]3007,[2017]3002];贵州省科技支撑计划[[2019]2004,[2018]2162];贵州省科技基础研究计划[[2017]1045,[2019]1049];贵州财经大学科研基金[2017XJC01];贵州大学研究生创新基金[研理工2016068];贵州省研究生科研基金立项课题[KYJJ2017005]。
摘 要:数据的私密性与可用性是隐私保护领域的核心问题。主成分分析差分隐私算法将高维数据的主成分降维与加噪融合,可有效对高维数据进行隐私保护并保持数据的高可用性。现有的主成分分析差分隐私保护算法依赖于皮尔逊相关系数,在隐私保护过程中仅能捕获高维数据的线性关系,且未考虑将差分预算在降维后的数据集上进行优化分配,导致算法普适性不足、数据效用不高。针对此问题,文章提出一种基于最大信息系数的主成分分析差分隐私(MIC-PCA-DP)数据发布算法。实验结果表明,文章所提出的隐私保护算法适用于线性关系、非线性关系、非函数依赖关系等数据关系的降维保持,其主成分降维可实现高效的原始数据信息承载;与经典的差分隐私算法以及PCA-basedPPDP算法相比,在相同隐私保护强度下,对数据添加的噪声更小,能够有效保持数据的可用性。The privacy and availability of data are important issues of privacy protection.Principal component analysis(PCA)differential privacy can effectively protect the privacy of high-dimensional data and maintain the high availability of data by integrating the dimensionality reduction and noise addition of the principal component of high-dimensional data.The existing principal component analysis differential privacy protection algorithm relies on Pearson correlation coefficient,which can only capture the linear relationship of high-dimensional data in the privacy protection process,and does not consider to optimize the allocation of difference budget on the data set after dimension reduction,resulting in insufficient applicability of the algorithm and low data utility.To this end,a differential private data publishing algorithm(MICPCA-DPPD)is proposed via principal component analysis based on maximum information coefficient.The experimental results show that the proposed privacy protection algorithm is suitable for dimensionality maintenance of linear relationships,nonlinear relationships,multifunction relationships,etc.Its principal component dimensionality reduction can achieve efficient raw data information bearing.Compared with the classic differential privacy algorithm and PCAbased PPDP algorithm,the noise added to the data is smaller and the data availability can be effectively maintained,under the same constraint of privacy protection intensity.
分 类 号:TP309[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.173