考虑时间特征的电子商务水军群组发现算法  被引量:3

E-commerce spammer groups discovery algorithm considering time characteristics

在线阅读下载全文

作  者:张文鹏 纪淑娟[1] 李金鹏 张琪[1] Zhang Wenpeng;Ji Shujuan;Li Jinpeng;Zhang Qi(Shandong Provincial Key Laboratory of Wisdom Mine Information Technology,Shandong University of Science&Technology,Qingdao Shandong 266590,China)

机构地区:[1]山东科技大学山东省智慧矿山信息技术重点实验室,山东青岛266590

出  处:《计算机应用研究》2021年第8期2321-2327,共7页Application Research of Computers

基  金:国家自然科学基金资助项目(71772107,62072288)。

摘  要:针对在电子商务平台上普遍存在的网络水军,提出了一个综合考虑网络结构与时间特征的算法来检测评论网络中的水军群组。该算法由四步组成:a)基于评论网络结构特征的分析挖掘出易受水军攻击的目标产品;b)受“共爆发现象”的启发,提出了一个目标产品被水军群组攻击的可疑时期挖掘算法;c)基于目标产品可疑时期内的数据,构造目标产品—评论者的诱导子图,并在该子图上应用层次凝聚聚类算法生成候选水军群组;d)为了过滤掉在可疑时期内购物并评论的正常用户,提出了一个水军群组净化方法,然后基于评论者的行为特征对净化后的群组进行分类。基于真实数据集的实验结果表明,该算法可以准确、高效地检测活跃在电子商务网站上的网络水军群组。Aiming at the ubiquitous network spammers on e-commerce platform,this paper proposed an algorithm considering network structure and time characteristics to detect the spammer groups in the comment network.The algorithm consisted of four steps:a)mining the target products that were vulnerable to attack by the spammers based on the analysis of the structural characteristics of the comment network;b)this paper proposed an algorithm for mining the suspicious period when the spammer groups attacked the target product inspired by the“co-bursting phenomenon”;c)this paper constructed the induced subgraph of target products-reviewers based on the data of target product in suspicious period,and applied hierarchical agglomerative clustering algorithm to generate candidate spammer groups on the subgraph;d)in order to filter out the normal users who shopped and commented during the suspicious period,this paper proposed a spammer groups purification method,and then classified the purified groups based on the behavior characteristics of the reviewers.The experimental results based on real data sets show that the proposed algorithm can accurately and efficiently detect the network spammer groups active on e-commerce websites.

关 键 词:电子商务 水军群组 可疑时期 层次聚类 

分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象