基于知识图嵌入的跨社交网络用户对齐算法  被引量:2

Cross-social network user alignment algorithm based on knowledge graph embedding

在线阅读下载全文

作  者:滕磊 李苑 李智星[1,2] 胡峰 TENG Lei;LI Yuan;LI Zhixing;HU Feng(College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;Chongqing Key Laboratory of Computing Intelligence(Chongqing University of Posts and Telecommunications),Chongqing 400065,China)

机构地区:[1]重庆邮电大学计算机科学与技术学院,重庆400065 [2]计算智能重庆市重点实验室(重庆邮电大学),重庆400065

出  处:《计算机应用》2019年第11期3198-3203,共6页journal of Computer Applications

基  金:国家重点研发计划项目(2017YFB0802305)~~

摘  要:针对目前跨社交网络用户对齐算法存在的网络嵌入效果不佳、负采样方法所生成负例质量无法保证等问题,提出一种基于知识图嵌入的跨社交网络用户对齐(KGEUA)算法。在嵌入阶段,利用部分已知的种子锚用户对进行正例扩充,并提出Near_K负采样方法生成负例,最后利用知识图嵌入方法将两个社交网络嵌入到统一的低维向量空间中。在对齐阶段,针对目前的用户相似度度量方法进行改进,将提出的结构相似度与传统的余弦相似度结合共同度量用户相似度,并提出基于自适应阈值的贪心匹配方法对齐用户,最后将新对齐的用户对加入到训练集中以持续优化向量空间。实验结果表明,提出的算法在Twitter-Foursquare数据集上的hits@30值达到了67.7%,比用户对齐现有最佳算法的结果高出3.3~34.8个百分点,显著提升用户对齐效果。Aiming at the poor network embedding performance of cross-social network user alignment algorithm and the inability to guarantee the quality of negative samples generated by negative sampling method,a cross-social network KGEUA(Knowledge Graph Embedding User Alignment)algorithm was proposed.In the embedding stage,some known anchor user pairs were used for the positive sample expansion,and the Near_K negative sampling method was proposed to generate negative examples.Finally,the two social networks were embedded into a unified low-dimensional vector space with the knowledge graph embedding method.In the alignment stage,the existing user similarity measurement method was improved,the proposed structural similarity was combined with the traditional cosine similarity to measure the user similarity jointly,and an adaptive threshold-based greedy matching method was proposed to align users.Finally,the newly aligned user pairs were added to the training set to continuously optimize the vector space.The experimental results show that the proposed algorithm has the hits@30 value of 67.7%on the Twitter-Foursquare dataset,which is 3.3 to 34.8 percentage points higher than that of the state-of-the-art algorithm,improving the user alignment performance effectively.

关 键 词:用户对齐 社交网络 网络嵌入 负采样 相似度度量 

分 类 号:TP182[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象