检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:滕磊 李苑 李智星[1,2] 胡峰 TENG Lei;LI Yuan;LI Zhixing;HU Feng(College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;Chongqing Key Laboratory of Computing Intelligence(Chongqing University of Posts and Telecommunications),Chongqing 400065,China)
机构地区:[1]重庆邮电大学计算机科学与技术学院,重庆400065 [2]计算智能重庆市重点实验室(重庆邮电大学),重庆400065
出 处:《计算机应用》2019年第11期3198-3203,共6页journal of Computer Applications
基 金:国家重点研发计划项目(2017YFB0802305)~~
摘 要:针对目前跨社交网络用户对齐算法存在的网络嵌入效果不佳、负采样方法所生成负例质量无法保证等问题,提出一种基于知识图嵌入的跨社交网络用户对齐(KGEUA)算法。在嵌入阶段,利用部分已知的种子锚用户对进行正例扩充,并提出Near_K负采样方法生成负例,最后利用知识图嵌入方法将两个社交网络嵌入到统一的低维向量空间中。在对齐阶段,针对目前的用户相似度度量方法进行改进,将提出的结构相似度与传统的余弦相似度结合共同度量用户相似度,并提出基于自适应阈值的贪心匹配方法对齐用户,最后将新对齐的用户对加入到训练集中以持续优化向量空间。实验结果表明,提出的算法在Twitter-Foursquare数据集上的hits@30值达到了67.7%,比用户对齐现有最佳算法的结果高出3.3~34.8个百分点,显著提升用户对齐效果。Aiming at the poor network embedding performance of cross-social network user alignment algorithm and the inability to guarantee the quality of negative samples generated by negative sampling method,a cross-social network KGEUA(Knowledge Graph Embedding User Alignment)algorithm was proposed.In the embedding stage,some known anchor user pairs were used for the positive sample expansion,and the Near_K negative sampling method was proposed to generate negative examples.Finally,the two social networks were embedded into a unified low-dimensional vector space with the knowledge graph embedding method.In the alignment stage,the existing user similarity measurement method was improved,the proposed structural similarity was combined with the traditional cosine similarity to measure the user similarity jointly,and an adaptive threshold-based greedy matching method was proposed to align users.Finally,the newly aligned user pairs were added to the training set to continuously optimize the vector space.The experimental results show that the proposed algorithm has the hits@30 value of 67.7%on the Twitter-Foursquare dataset,which is 3.3 to 34.8 percentage points higher than that of the state-of-the-art algorithm,improving the user alignment performance effectively.
关 键 词:用户对齐 社交网络 网络嵌入 负采样 相似度度量
分 类 号:TP182[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.145