检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机技术与发展》2008年第5期79-81,85,共4页Computer Technology and Development
基 金:国家自然科学基金(60675031;60475017);安徽省教育厅重点自然科学研究项目(2006KJ015A);安徽省教育厅自然科学研究项目(2005kj053);安徽大学211工程学术创新团队;973计划(国家重点基础研究)(2004CB318108)
摘 要:利用CHI值特征选取和前向神经网络的覆盖算法,通过对文本进行分词的预处理后,实现文本的自动分类。该方法利用CHI值进行特征选取即特征降维,应用覆盖算法进行文本分类。该方法将CHI值特征选取和覆盖算法充分结合,在提高了分类速度的同时还保证了分类的准确度。应用该方法对标准数据集中的文本进行实验,并在不同的维数上与SVM算法、朴素贝叶斯方法的实验结果进行了比较。结果表明,与SVM算法和朴素贝叶斯方法相比较,覆盖算法在准确度上更好。并且,维数的选择对分类的精确度影响很大。Based on CHI value feature selection and the cover algorithm of forward neural network, realizes the automatic classification of texts after the preprocessing of the texts. Based on the CHI values, the features of text set were selected firstly,namely declining dimention of features, and then text classification was processed by the cover algorithm. The method combined CHI value feature selection and the cover algorithm fully so as to promise the accurate degree of the classification at the time of raising the classification speed. Do experiment on the texts of the standard data set in this method, and compare with the experiment result of SVM and naive Bayes on the different dimention. Experiment results demonstrate that comparing with the SVM and naive Bayes, the cover algorithm do better on accurate degree. And the influence of choice of dimention to accuracy of classification is very great.
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.80