基于MapReduce的网络舆情分析系统的设计与实现  被引量:2

Design and implementation of public opinion analysis system for network based on MapReduce

在线阅读下载全文

作  者:黄蔚[1] 李戴维[1] 

机构地区:[1]华北计算技术研究所信息技术应用系统部,北京100083

出  处:《信息技术》2014年第7期149-153,共5页Information Technology

摘  要:设计并实现了一个基于MapReduce的网络舆情分析系统。系统采用HDFS和HBase双存储机制存储数据。通过实验分析与效果比对,选用MMSeg4j为系统进行中文分词。改进了Canopy-Kmeans算法实现文本自动聚类,提高了系统的聚类准确度及效率。目前,该系统已应用于某部队舆情分析系统中,能够实时发现热点话题、准确把握舆情趋势,为应对舆论危机、制定舆论政策提供了科学系统的信息支持。This paper designed a network public opinion analysis system based on MapReduce. Dual storage mechanism composed of HDFS and HBase used for storing data. MMSeg4j was selected for Chinese word segmentation by comparing the experimental results and word segmentation efficiency. In order to improve the accuracy and efficiency of the clusters, the Canopy-Kmeans algorithm was improved. Currently, the system has been applied to a public opinion analysis system in an army, the system can detect hot topics in real time and grasp the trend of public opinion accurately. It offered scientific and systematic support for dealing with public opinion crises and formulating public policy.

关 键 词:HADOOP 舆情分析 MAPREDUCE 中文分词 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象