检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:罗梁[1] 王文贤[1,2] 钟杰[1] 王海舟[1]
机构地区:[1]四川大学计算机学院网络与可信计算研究所,四川成都610065 [2]四川大学网络空间安全研究院,四川成都610065
出 处:《信息网络安全》2017年第2期51-58,共8页Netinfo Security
基 金:国家科技支撑计划[2012BAH18B05];国家自然科学基金[61272447]
摘 要:近年来,随着社交网络大规模普及,社交网络在人们生活中扮演了越来越重要的角色。它们拥有海量的用户规模,但进行实名认证的用户却只占很小的比例,这使得恶意用户可以肆意散播各种谣言和不良信息,给互联网监管带来了巨大挑战。因此对跨社交网络的实体用户进行关联,建立身份识别信息网络,有助于解决用户的身份识别和监管问题。文章设计实现了针对QQ空间和新浪微博的信息采集系统,然后针对网络上采集到的544万微博用户和2459万QQ空间用户的资料和行为数据进行分析,提出了一种用户跨社交网站关联整体模型。该模型基于逻辑回归模型进行用户判定分类,同时根据SimRank算法的原理提出了SNC算法剔除噪声用户,提高模型精确度,最后利用本文筛选出的数据集进行跨社交网络用户关联实验。实验结果表明本模型能够筛选出关联性较强的用户对,经过剪枝处理后模型精确度有效提升,模型能够有效的对不同社交网络的用户进行关联。With the massive popularity of social networks in recent years, social network has played a very important role in peopled daily lives. It has a lot of users, but few of them needs real name authentication, which malicious users can freely spread rumors and bad information to the public and bring challenges to Internet regulations. Therefore, associating entity users across different social networks, establish the network identification can help identify and supervise the users. The paper 5s main research work are as follows. Firstly we designed a system to collect QZone and Weibo’s user’s information. Secondly we analyze the data we collect from the internet which contains 5,440,000 users of Weibo and 24,590,000 users of QZone. Then we proposed a model of users associated across social network. This model is based on logic regression model which is used to classify the users, at the same time, according to the principle of SimRank algorithm, the SNC algorithm is proposed to eliminate the noise and improve the accuracy of the model. Finally we use the model on the dataset we collected. The experimental result shows that the model can filter out pairs of users that associated strongly, the accuracy of the model has improved and the model can associate users of different social networks after pruning.
关 键 词:跨社交网络 用户关联 信息采集 SNC算法 逻辑回归模型
分 类 号:TP393.09[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.46.149