深度置信网络的Spark并行化在微博情感分类中的应用研究  被引量:5

RESEARCH ON SPARK PARALLELIZATION OF DEEP CONFIDENCE NETWORK IN MICROBLOGGING EMOTION CLASSIFICATION

在线阅读下载全文

作  者:张翔[1] 石力 尚勃 董丽丽[1] 

机构地区:[1]西安建筑科技大学信息与控制工程学院,陕西西安710055

出  处:《计算机应用与软件》2018年第2期48-53,共6页Computer Applications and Software

基  金:国家自然科学基金项目(51278400);陕西省自然科学基础研究基金项目(2016JM6031)

摘  要:中文微博情感分析可以发现公众对热点事件的态度掌握网络舆情,因此成为文本挖掘的一个热点研究。采用一种基于Spark并行化的深度置信网络的情感分类方法,该方法利用Word2Vec工具表示微博文本和建立情感词典;使用深度置信网络构建微博情感分类模型;通过Spark集群对深度置信神经网络进行并行化处理。实验结果表明,基于深度置信网络的微博情感分类模型在Spark平台下并行化,训练时间大幅缩短,情感分类的准确率比传统的浅层学习方法高5%。Chinese micro blog sentiment analysis can be found that the public's attitude toward hot events and grasp the network public opinion,thus become a hot research in the text mining. This paper put forwards the parallelization of deep belief networks for Chinese micro blog sentiment analysis by Spark. Firstly,the Word2Vec tool was used to express the microblogging text and the establishment of the emotional dictionary. Then,the microblogging emotion classification model was constructed by using the deep confidence network. Finally,the neural network of the deep confidence neural network was processed by the Spark cluster. The experimental results showed that the microblogging emotion classification model based on deep confidence network was parallelized under the Spark platform,and the training time was shortened,and the accuracy of emotion classification was 5% higher than that of the traditional shallow learning method.

关 键 词:中文微博 情感分析 深度置信 网络Spark并行化 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象