检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]大庆石油学院计算机与信息技术学院,黑龙江大庆163318
出 处:《计算机应用》2007年第11期2860-2862,共3页journal of Computer Applications
基 金:黑龙江省研究生创新科研资金项目(YJSCX2006-38HLJ)
摘 要:目前性能较好的多分类算法有1-v-r支持向量机(SVM)、1-1-1SVM、DDAG SVM等,但存在大量不可分区域且训练时间较长的问题。提出一种基于二叉树的多分类SVM算法用于电子邮件的分类与过滤,通过构建二叉树将多分类转化为二值分类,算法采用先聚类再分类的思想,计算测试样本与子类中心的最大相似度和子类间的分离度,以构造决策节点的最优分类超平面。对于C类分类只需C-1个决策函数,从而可节省训练时间。实验表明,该算法得到了较高的查全率、查准率。Now some preferable performance multiclass algorithms, such as 1-v-r support vector machine (SVM) , 1-1-1 SVM and DDAG SVM, have many problems of impartible regions and longer training time. A new multiclass SVM algorithm based on binary tree was introduced on E-mail filtering. It could convert muhiclass problem to binary classification by constructing binary tree. The idea of clustering first and classification later was adopted, and the largest similarity between testing sample and sub-category center and the separation measure of sub-categories were calculated, in oder to construct the optimal class hyperplane of decision-making nodes. Only C - 1 optimal functions were needed for C kinds of classification, so training time could be saved, The experiment results show that the new algorithm has higher filtering recall and precision.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.249