检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]西南交通大学信息科学与技术学院
出 处:《计算机应用研究》2012年第10期3805-3808,共4页Application Research of Computers
基 金:国家自然科学基金-联合资助基金资助项目(U0970122);中央高校基本科研业务费专项资金科技创新项目(SWJTU09CX040)
摘 要:对基于流的垃圾邮件行为识别技术进行了研究。根据垃圾邮件与正常邮件通信拓扑具有较大差异的特性,引入相似度的概念,提出了一种基于拓扑相似性的垃圾邮件行为识别方法。该方法以收发件人联系表来表征收发件人,计算用户相似度以此将邮件用户划分为多个邮件用户群,通过计算邮件收发件人归属判别邮件是否为垃圾邮件。采用一个辅助分类器方便对原始邮件用户进行判别和分组,最后用真实的邮件集进行实验,结果证明基于拓扑结构相似性分类方法有较好的分类能力。Spam behavior recognition technology,especially the behavior recognition technology based on the e-mail flow was studied.According to the communication topology of legitimate e-mail and spam with large differences,this paper introduced the concept of similarity,proposed a behavior recognition technology based on the similarity of topology.With this method,the e-mail receivers and senders were marked by the contact list.The e-mail users were divided into e-mail user clusters by calculating similarity.The senders and receivers of the coming e-mail were classified to clusters to judge whether the e-mail was spam.This paper used an auxiliary filter for classification and recognition of the original e-mail information.Simulation test with real e-mail set was conducted,which shows that the method based on the similarity of topology provides a better result for spam classification.
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.227.183.215