检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:吕妹园 张永健[1] 张永强[1] 孙胜娟[1] LYU Meiyuan;ZHANG Yongjian;ZHANG Yongqiang;SUN Shengjuan(School of Information and Electrical Engineering,Hebei University of Engineering,Handan,Hebei 056107,China)
机构地区:[1]河北工程大学信息与电气工程学院,河北邯郸056107
出 处:《河北科技大学学报》2021年第4期380-388,共9页Journal of Hebei University of Science and Technology
基 金:河北省创新能力提升计划项目(19456003D)。
摘 要:为了解决分布式词表示方法因忽略词语情感信息导致情感分类准确率较低的问题,提出了一种融入情感信息加权词向量的情感分析改进方法。依据专属领域情感词典构建方法,结合词典和语义规则,将情感信息融入到TF-IDF算法中,利用Word2vec模型得到加权词向量表示方法,并运用此方法对采集到的河北省旅游景点的评论文本与对照组进行对比实验。结果表明,与基于分布式词向量表示的情感分析方法相比,采用融入情感信息加权词向量的改进方法进行情感分析,积极文本的准确率提高了6.1%,召回率提高了6.6%,F值达到了90.3%;消极评论文本的准确率提高了6.0%,召回率提高了7.2%,F值达到了89.6%。因此,融入情感信息加权词向量的情感分析改进方法可以有效提高评论文本情感分析的准确率,为用户获得更为准确的评论观点提供参考。In order to solve the problem of low accuracy of sentiment classification caused by neglecting the sentiment information of words in distributed word representation method,an improved sentiment analysis method incorporating weighted word vectors of sentiment information was proposed.According to the exclusive domain sentiment dictionary,combined with the dictionary and semantic rules,the sentiment information is integrated into the TF-IDF algorithm,and the weighted word vector representation method is obtained by using word2vec model.The method is used to compare the collected comments of tourist attractions in Hebei Province with the control group.The results show that compared with the sentiment analysis method based on distributed word vector representation,the accuracy and recall rate of positive text are increased by 6.1%and 6.6%,and the F value reached 90.3%,the accuracy and recall rate of negative text are increased by 6.0%and 7.2%,and the F value reached 89.6% by using the improved method of sentiment analysis integrated with sentiment information weighted word vector.Therefore,the improved method of sentiment analysis integrated with sentiment information weighted word vector can effectively improve the accuracy of sentiment analysis of comment text,and provide valuable reference for users to obtain more accurate comments.
关 键 词:自然语言处理 语义规则 情感信息 TF-IDF Word2vec 加权词向量 情感分析
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.112