检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]浙江师范大学数理与信息工程学院,浙江金华321004
出 处:《山东大学学报(工学版)》2014年第6期15-18,69,共5页Journal of Shandong University(Engineering Science)
基 金:浙江省教育厅科研资助项目(Y201328291);浙江省语委十二五科研规划资助项目(ZY2011C77);国家自然科学基金资助项目(61272007)
摘 要:针对情感分类中采用单一特征分类精度不高的问题,提出多特征加权的分类算法:根据扩展的情感词典计算每个词的情感倾向度,经CHI特征选择后,根据情感词的极性强度调整贝叶斯分类模型中该词的正负后验概率,在原值的基础上加上极性强度影响值。实验将该方法和其他3种单特征选择方法在酒店、影视等语料上的分类精度进行了对比,分类精度得到提升。实验结果表明,将词语的情感倾向度的特征融入到分类器中方法,在有效提高情感倾向性分类精度的同时降低了特征维数。In the traditional classification method,only one feature was considered,that was not good enough for the precision.In order to improve the precision,a classification method based on integrated features was provided.First, the emotional tendency value of one word was calculated according to an extended sentiment dictionary;then after the CHI selection,the weights of the positive and negative emotion word posterior probability in the Bayesian model were adjusted acrodding to its tendency value.In the experiments,four kinds of corpus such as hotel and movie reviews were used,compared with other three methods,the integrated features method was better.The results showed the precision of classification was improved and the dimension of the feature was reduced.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.72