检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国计量学院信息工程学院,浙江杭州310018 [2]杭州市质量技术监督检测院,浙江杭州310019
出 处:《中国计量学院学报》2015年第3期285-290,共6页Journal of China Jiliang University
基 金:国家自然科学基金重大专项项目(No.61027005)
摘 要:电子商务产品的评论信息对于电子商务产品质量舆情监测具有极大的参考价值.针对集成学习算法在高维度下分类精度降低的不足之处,提出了一种IG-RS-SVM(Information Gain-Random Subspace-Support Vector Machine)算法.以Random Subspace集成学习算法为基础,以支持向量机算法为基学习器.引入了信息增益特征选择算法.通过对特征空间中每个特征的信息增益值进行排序,剔除无价值的特征,降低RS集成算法生成的特征子空间的维度,从而提高了SVM分类算法的效率.实验结果表明,改进后算法可以有效提高评论内容的分类精度.The remarks of E-commerce products have a great reference value for the public opinion monitoring of E-commerce product quality. Aiming at the deficiency of reducing the classification accuracy with ensemble learning for high-dimensional datasets, a new algorithm, IG-RS-SVM, was proposed. It was based on Random Subspaee, taking SVM as a base learner, and applying the information gain algorithm. By sorting the information gain value of each feature in the feature space, excluding worthless features, and reducing the dimension of feature subspace generated by the Random Subspace algorithm, the efficiency of the SVM classification algorithm was increased. The experimental result shows that the improved algorithm can effectively improve the classification accuracy of remarks.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.46