检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:蔡利忠 蔡晓晨 CAI Li-zhong;CAI Xiao-chen(Xilingol Electric Power Bureau Substation Management Office,Xilinhot 026000,China)
机构地区:[1]锡林郭勒电业局变电管理处,内蒙古锡林浩特026000
出 处:《计算机工程与设计》2018年第9期2974-2978,2991,共6页Computer Engineering and Design
摘 要:为提高中文文本的分类效果,提出基于深度置信网络的中文文本分类模型,分别以文本的TF-IDF和LSI特征作为输入,利用深度置信网络强大的特征学习能力获取深层次特征,提高最终的分类效果。实验结果表明,LSI特征更适合作为深度置信网络文本分类模型的输入,相比SVM等浅层模型,深度置信网络在中文文本分类任务中更加有效,经过合理的训练和参数设置可以取得比SVM模型更好的分类效果,分类准确率提高了3.4%。To improve the performance of Chinese text categorization,a Chinese text classification model based on deep belief networks was proposed.The TF-IDF and LSI features of the text were taken as input,and the powerful feature learning ability of deep belief networks was used to acquire deep level features to improve the final classification performance.Experimental results show that LSI feature is more suitable for DBN Chinese text classification model,and compared with SVM and other shallow models,deep belief networks is more effective.With reasonable training and parameter settings,deep belief networks can achieve better classification results than the SVM model and the classification accuracy is improved by 3.4%.
关 键 词:文本分类 深度置信网络 文本特征 LSI特征 受限制玻尔兹曼机
分 类 号:TP391.4[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.7