KSKV:Key-Strategy for Key-Value Data Collection with Local Differential Privacy  

在线阅读下载全文

作  者:Dan Zhao Yang You Chuanwen Luo Ting Chen Yang Liu 

机构地区:[1]Artificial Intelligence Development Research Center,Institute of Scientific and Technical Information of China,Beijing,100038,China [2]Industry Development Department,NSFOCUS Inc.,Beijing,China [3]School of Information Science and Technology,Beijing Forestry University,Beijing,100083,China [4]School of Computer Science and Engineering,University of Electronic Science and Technology of China,Chengdu,610054,China [5]Institute of Computing Technology,China Academy of Railway Sciences Corporation Limited,Beijing,10081,China

出  处:《Computer Modeling in Engineering & Sciences》2024年第6期3063-3083,共21页工程与科学中的计算机建模(英文)

基  金:supported by a grant fromthe National Key R&DProgram of China.

摘  要:In recent years,the research field of data collection under local differential privacy(LDP)has expanded its focus fromelementary data types to includemore complex structural data,such as set-value and graph data.However,our comprehensive review of existing literature reveals that there needs to be more studies that engage with key-value data collection.Such studies would simultaneously collect the frequencies of keys and the mean of values associated with each key.Additionally,the allocation of the privacy budget between the frequencies of keys and the means of values for each key does not yield an optimal utility tradeoff.Recognizing the importance of obtaining accurate key frequencies and mean estimations for key-value data collection,this paper presents a novel framework:the Key-Strategy Framework forKey-ValueDataCollection under LDP.Initially,theKey-StrategyUnary Encoding(KS-UE)strategy is proposed within non-interactive frameworks for the purpose of privacy budget allocation to achieve precise key frequencies;subsequently,the Key-Strategy Generalized Randomized Response(KS-GRR)strategy is introduced for interactive frameworks to enhance the efficiency of collecting frequent keys through group-anditeration methods.Both strategies are adapted for scenarios in which users possess either a single or multiple key-value pairs.Theoretically,we demonstrate that the variance of KS-UE is lower than that of existing methods.These claims are substantiated through extensive experimental evaluation on real-world datasets,confirming the effectiveness and efficiency of the KS-UE and KS-GRR strategies.

关 键 词:KEY-VALUE local differential privacy frequency estimation mean estimation data perturbation 

分 类 号:TP31[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象