基于敏感词分析的高校舆情监控系统设计与实现  

The Design of a Public Opinion Monitoring System at Universities and its Implementation Based on the Analysis of Sensitive Words

在线阅读下载全文

作  者:朱金山[1] 

机构地区:[1]浙江大学宁波理工学院,浙江宁波315100

出  处:《集宁师范学院学报》2017年第6期37-41,共5页Journal of Jining Normal University

基  金:浙江省教育科学规划课题"基于有线;无线一体化网络构建高校舆情分析系统"(课题编号:2017SCG228)

摘  要:网络敏感词分析是舆情监控系统的关键,该文介绍了Spark、Flume、kafka等用于系统架构的主要开源组件,分析了敏感词分析中主要用到的Han LP中文分词和命名实体识别两大组件,以及利用Word2vec训练词向量组件进行相似度判断的算法原理及时间复杂度比较,根据高校网络用户流量特征,提出了舆情监控的系统架构设计,最后展示了系统原型实现,并对其进行了探讨及前景展望。The analysis of the sensitive words on the network at universities is the key to the public opinion monitoringsystem. Based on the open components like Spark, Flume and Kafka applied to systematic constructions, this paper analyzesthe two dominantly-used components concerning the analysis of sensitive words-the components of Chineseword-segmentation and naming-identification from the system of HanLP. Also, arithmetic principles and temporalcomplexity for similarity judgement are suggested by Word2vec. Thus, it is proposed that the public opinion monitoringshould be systematically designed and implemented based on users’ flow characteristics at universities. Finally, a case of theoriginal system is demonstrated concerning its discussion and outlook.

关 键 词:敏感词分析 SPARK FLUME Kafka HanLP Word2vec 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象