检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:孙大鹏
机构地区:[1]国家计算机网络应急技术处理协调中心辽宁分中心,辽宁沈阳110035
出 处:《信息网络安全》2015年第7期13-19,共7页Netinfo Security
基 金:国家242信息安全计划[2011A011]
摘 要:垃圾短信的问题日益突显,不仅对人们的正常生活造成了诸多的不良影响,还对公共安全和社会稳定造成了一定程度的危害。因此对垃圾短信准确过滤显得尤其重要。经过研究发现,现有的短信过滤技术存在一些不足:基于黑白名单的过滤技术显得过于简单粗暴,基于内容分析的垃圾短信过滤技术虽然准确度得到很大程度的提高,但在实现上也存在着复杂度过高、易导致信息网络阻塞等不足。针对这一缺点,文章详细调查分析了近年来飞速发展起来的云计算技术,发现其在伸缩性、可靠性、成本等方面具有非常大的优势,尤其是依靠它的高扩展能力可以把计算规模做到无限大,而成本又非常低,可以作为不错的计算平台。在此基础上,文章深入分析正在使用的垃圾短信过滤的实现方案,对各过滤实现方式的原理及其性能做了仔细分析比较。文章分析了现行基于内容过滤器所使用的算法,发现其可以通过云计算的Hadoop开源实现方案中的Map Reduce编程模型来实现。The problem of junk message has become more severe. The flood of junk message has not only greatly disturbed people's life and also endangered public security and social stability. Therefore, the research of accurate and intelligent filter of junk message is of great significance. The research of existing filtration methods indicates that their implement has some shortcoming. The filtration methods based on black and white list are too simple and brutal. Although, the accuracy of content-based filtration has been improved greatly, their complexity of algorithm usually is cause of operator service network jam. The research indicates that the cloud computing technology has a great advantage in scalability, reliability, cost and other aspects. In particular, the scale of computing power can be made of infinite size in low cost relied on its high-expansion of scale. So the cloud computing is a good platform. Based on this foundation, the essay conducted a careful analysis of algorithm principle of content-based filtration and found that almost all the algorithm of content-based filtration currently used is based on Bayes classification theory. After a detailed study and relevant experiment, found that the content-based filter can be implemented by relying on the cloud computing platform and MapReduce programming model.
关 键 词:云计算 垃圾短信过滤 HADOOP MAP REDUCE
分 类 号:TN929.53[电子电信—通信与信息系统]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.133.128.223