检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《吉林大学学报(理学版)》2008年第6期1101-1104,共4页Journal of Jilin University:Science Edition
基 金:国家自然科学基金(批准号:10571073)
摘 要:提出一种基于多重假设检验的特征加权朴素贝叶斯分类算法,该算法通过特征选择方法得到多个特征词集合,再按多重假设检验错误率为每个特征词集合配以不同的权重系数并参与到分类器的构建中.该方法已经应用到市长公开电话的文本分类中,通过构建的3个特征加权朴素贝叶斯分类器实现了投诉文本的计算机自动分类,且相对传统方法提高了分类器的效率和精度.On the basis of multiple hypothesis testing, we proposed a feature weighted naive Bayesian algorithm, which outputs many sets of feature words by means of feature selection, and assigns a coefficient to each set of feature words which is used to construct the classifier in terms of the error rate of multiple hypothe- sis testing. This algorithm was used in the text classification of the mayor' s public access line project, where we realized the automatic classification of complaint texts by constructing three feature weighted naive Bayesian classifiers. Compared with those of the traditional methods, the efficiency and accuracy of our classifier are higher
分 类 号:O235[理学—运筹学与控制论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.30