基于带权评论图的水军群组检测及特征分析  被引量:7

Weighted reviewer graph based spammer group detection and characteristic analysis

在线阅读下载全文

作  者:张琪[1] 纪淑娟[1] 傅强 张纯金[3] ZHANG Qi;JI Shujuan;FU Qiang;ZHANG Chunjin(Shandong Key Laboratory of Wisdom Mine Information Technology (Shandong University of Science and Technology),Qingdao Shangdong 266590,China;Technical Reconnaissance Detachment,Qinhuangdao Public Security Bureau,Qinhuangdao Hebei 066100,China;Network Information Center,Shandong University of Science and Technology,Qingdao Shangdong 266590,China)

机构地区:[1]山东省智慧矿山信息技术重点实验室(山东科技大学),山东青岛266590 [2]秦皇岛市公安局技术侦察支队,河北秦皇岛066000 [3]山东科技大学网络信息中心,山东青岛266590

出  处:《计算机应用》2019年第6期1595-1600,共6页journal of Computer Applications

基  金:国家自然科学基金资助项目(71772107,61502281)~~

摘  要:针对在电子商务平台上检测编写虚假评论的水军群组的问题,提出了基于带权评论图的水军群组检测算法(WGSA)。首先,利用共评论特征构建带权评论图,权重由一系列群组造假指标计算得到;然后,为边权重设置阈值筛选可疑子图;最后,从图的社区结构出发,利用社区发现算法生成最终的水军群组。在Yelp大型数据集上的实验结果表明,与K均值聚类算法(KMeans)、基于密度的噪声应用空间聚类算法(DBscan)以及层次聚类算法相比WGSA算法的准确度更高,同时对检测到水军群组的特征与差异作了分析,发现水军群组的活跃度不同,危害也不同。其中,高活跃度群组危害最大,应重点关注。Concerning the problem that how to detect spammer groups writing fake reviews on the e-commerce platforms, a Weighted reviewer Graph based Spammer group detection Algorithm(WGSA) was proposed. Firstly, a weighted reviewer graph was built based on the co-reviewing feature with the weight calculated by a series of group spam indicators. Then, a threshold was set for the edge weight to filter the suspicious subgraphs. Finally, considering the community structure of the graph, the community discovery algorithm was used to generate the spammer groups. Compared with K-Means clustering algorithm(KMeans), Density-Based spatial clustering of applications with noise(DBscan) and hierarchical clustering algorithm on the large dataset Yelp, the accuracy of WGSA is higher. The characteristics and distinction of the detected spammer groups were also analyzed, which show that spammer groups with different activeness have different harm. The high-active group is more harmful and should be concerned more.

关 键 词:电子商务 水军群组 带权评论图 社区发现 聚类 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象