基于模糊近似度的隐私敏感数据过滤算法  被引量:2

Privacy-sensitive data filtering algorithm based on fuzzy approximation

在线阅读下载全文

作  者:方朝剑 胡新荣[1] FANG Chao-jian;HU Xin-rong(School of Computer Science and Artificial Intelligence,Wuhan 430073,China)

机构地区:[1]武汉纺织大学计算机与人工智能学院,武汉430073

出  处:《吉林大学学报(工学版)》2023年第4期1174-1180,共7页Journal of Jilin University:Engineering and Technology Edition

基  金:国家自然科学基金项目(61807013);湖北省高等学校优秀中青年科技创新团队计划项目(T201807);湖北省教育厅科学研究计划重点项目(D20191708)。

摘  要:针对目前现有算法对隐私敏感数据进行过滤时,仅使用单一的近似度获取方法,在求取近似度时存在一定的局限性,导致平均绝对误差(MAE)值和均方根误差(RMSE)值高的问题,提出了一种基于模糊近似度的隐私敏感数据过滤算法。首先通过改进的局部敏感哈希算法E2LSH对数据进行降维处理,获取到更利于后续近似度计算的低维数据,然后采用Paillier同态加密算法在保证数据安全性的前提下对隐私敏感数据信息进行提取,最后构建梯形模糊评分模型,通过修正余弦相似性和皮尔森相关相似性的混合模型相似性算法对模糊近似度进行计算,完成对隐私敏感数据的过滤。分析实验结果可知,本文方法的MAE最低值低于0.82,说明该方法能够有效地降低MAE值和RMSE值,提升数据过滤效果。When the current algorithm is used to filter privacy-sensitive data,only a single approximation acquisition method is used.There are certain limitations in obtaining the approximation,which leads to the problem of high MAE and RMSE values.A privacy-sensitive data filtering algorithm based on fuzzy approximation is proposed.First,the data is reduced in dimensionality through the improved local sensitive hash algorithm E2LSH,and low-dimensional data that is more conducive to the subsequent approximation calculation is obtained,and then the Paillier homomorphic encryption algorithm is used to protect the privacy-sensitive data under the premise of ensuring data security.After extraction,the trapezoidal fuzzy scoring model is finally constructed,and the fuzzy approximation is calculated by the mixed model similarity algorithm of modified cosine similarity and Pearson correlation similarity to complete the filtering of privacy-sensitive data.Analysis of the experimental results shows that the minimum MAE value of the proposed method is lower than 0.82,indicating that the method can effectively reduce the MAE value and RMSE value and improve the data filtering effect.

关 键 词:模糊近似度 隐私敏感数据 数据过滤 Paillier同态加密算法 混合模型相似性算法 余弦相似性 皮尔森相关相似性 梯形模糊评分模型 

分 类 号:TP391.3[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象