检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]广西工学院计算机工程系,广西柳州545006 [2]广西大学计算机与电子信息学院,南宁530004
出 处:《计算机应用研究》2011年第6期2095-2098,共4页Application Research of Computers
基 金:广西工学院自然科学基金资助项目(院科自1074011)
摘 要:针对支持向量机在训练大规模数据集时出现的速度瓶颈问题,提出一种新的减样方法,称为双层减样法。数据减样时,双层减样法从粗、细粒度两个层次削减样本。粗粒度约减时,利用核空间距离聚类法,以簇为单位削减冗余子集;细粒度约减时,以点为单位挑选剩余点集中的支持向量。实验表明,双层减样法能有效地压缩样本数据,同时还能放大数据集的分类特征,提高分类器的分类精度。将此法应用于大规模SVM垃圾标签检测模型的训练集优化上,能明显提高检测模型的训练速度。双层减样法将粒度和层次的概念引入减样法中,在约减时适时改变约减幅度,这比传统减样法更具有优势。In order to improve the low efficiency of large-scale SVM,this paper presented a new samples reduction method,called double-layer reduction method.It reduced data in two levels.The first level was coarse-grained reduction.It deleted the redundant clusters with KDC reduction.The second level was fine-grained reduction.It picked out the support vectors from the clusters remained by SMO.The experiments show that double-layer reduction method gives a higher compression ratio and accuracy.It applied the new method to the large scale SVM social spam detection model.The detection model speeds up obviously.Unlike the traditional reduction method,double-layer reduction method uses the concept of "Granularity" and "Level" into reducing method.It changes the reduced intensity according to the number of redundant points remained which has more advantage in reducing.
关 键 词:FOLKSONOMY 垃圾标签 支持向量机 双层减样法 约减
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.104