混洗差分隐私保护的度分布直方图发布算法  被引量:1

Histogram publishing algorithm for degree distribution via shuffled differential privacy

在线阅读下载全文

作  者:丁红发[1,2,3] 傅培旺 彭长根 龙士工[2] 吴宁博[1] DING Hongfa;FU Peiwang;PENG Changgen;LONG Shigong;WU Ningbo(College of Information,Guizhou Key Laboratory of Big Data Statistical Analysis,Guizhou University of Finance and Economics,Guiyang 550025,China;State Key Laboratory of Pubic Big Data,Guizhou University,Guiyang 550025,China;Guian Science and Technology Industry Development Co.,Ltd.,Guiyang 550025,China)

机构地区:[1]贵州财经大学信息学院贵州省大数据统计分析重点实验室,贵州贵阳550025 [2]贵州大学公共大数据国家重点实验室,贵州贵阳550025 [3]贵安新区科创产业发展有限公司,贵州贵阳550025

出  处:《西安电子科技大学学报》2023年第6期219-236,共18页Journal of Xidian University

基  金:国家自然科学基金(62002080);贵州财经大学校级项目(2021KYYB14)。

摘  要:当前,基于中心化或本地差分隐私的图数据度分布直方图发布算法无法有效平衡发布数据的隐私保护程度及其可用性,且不能有效保护用户的身份隐私。针对该问题,在编码-混洗-分析框架下提出一种混洗差分隐私保护的度分布直方图发布算法。首先,设计混洗差分隐私图数据度分布直方图隐私保护框架,采取交互式用户分组、混洗器及方波本地加噪扰动机制降低编码器对分布式用户本地差分隐私加噪的噪声影响,并利用极大似然估计在分析器端对加噪后的度分布直方图进行数据矫正,从而提高数据效用;其次,提出具体的分布式用户分组、混洗差分隐私加噪和数据矫正算法,并证明其满足(ε,σ)-混洗差分隐私。实验和对比结果表明,所提算法能保护分布式用户隐私,在L_(1)距离、H距离和MSE多个指标度量下的数据效用比已有算法提升了26%以上,且具有较低的时间开销和稳定的数据效用表现,适用不同规模的图数据度分布直方图发布共享应用。At present,the existing histogram publishing algorithms based on centralized or local differential privacy for graph data degree distribution can neither balance the privacy and utility of published data,nor preserve the identity privacy of end users.To solve this problem,a histogram publishing algorithm for degree distribution via shuffled differential privacy(SDP)is proposed under the framework of Encode-Shuffle-Analyze.First,a privacy preserving framework for histogram publishing of degree distribution is designed based on shuffled differential privacy.In this framework,the noisy impact that the encoder brings to distributed users is reduced by employing interactive user grouping,the shuffler and the square wave noise mechanism,while adding noise via local differential privacy.The noisy histogram of degree distribution is reconciled via the maximum likelihood estimation at the analyzer end,thus improving the utility of published data.Second,specific algorithms are proposed for concreting distributed user grouping,adding shuffled differential privacy noise and reconciling the noisy data,respectively.Furthermore,it is proved that these algorithms meet the requirement of(ε,σ)-SDP.Experiments and comparisons illustrate that the proposed algorithms can preserve the privacy of distributed users,and that the data utility is improved more than 26%with metrics in terms of L_(1) distance,H distance and MSE in comparison with the existing related algorithms.The proposed algorithms also perform with a low overhead and stable data utility,and are suitable for publishing and sharing the histogram of degree distribution for different scales of graph data.

关 键 词:隐私保护技术 图结构 混洗差分隐私 度分布直方图发布 数据效用 

分 类 号:TN918[电子电信—通信与信息系统] TP309[电子电信—信息与通信工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象