基于MapReduce的SVM分类算法研究被引量：1

Research on SVM Classification Algorithm Based on MapReduce

机构地区：[1]南京邮电大学教育科学与技术学院,江苏南京210003 [2]南京邮电大学计算机学院,江苏南京210003

出　　处：《计算机技术与发展》2015年第6期87-91,共5页Computer Technology and Development

基　　金：江苏省自然科学基金项目(BK20130882)

摘　　要：云计算环境中,传统的基于MapReduce的SVM分类算法对数据集的训练是将各子节点训练后得到的支持向量进行合并,得到的分类器分类效率和准确率不理想。为此,文中提出了一种改进的训练算法,在各节点上运用遗传算法来寻找子数据集的最优核函数及参数,用得到的参数组合对子数据集进行训练得到支持向量,合并每个节点训练后的支持向量为全局支持向量,然后在各个节点上将子集与全局支持向量合并作为新的训练数据集。重复这四个步骤,直到全局支持向量不再变化时,则收敛到最优分类模型。最后,经开源云计算平台Hadoop实验验证,该算法的分类正确率比传统的分类算法有了明显提高。In cloud computing environment,the method adopted by the traditional SVM sorting algorithms based on MapReduce of train-ing data set is too simple and it just merges support vectors after nodes’ training,so the efficiency and accuracy of classifier are not very ideal. To solve the problem above,an improved training algorithm is proposed in this paper. Firstly,use the genetic algorithm to get the optimal kernel function and parameters on each node at the same time,then using the combination to train the data set for support vector, and afterwards,combining all support vectors after training as a global support vector,and then merging every data subset with global support vector on each node to get a new training data set. Repeat these four steps until the global support vector no longer changes and that’ s to say,it converges to the optimal classification model. Finally,the experiment on Hadoop proves that the classification accuracy of new algorithm is improved obviously than traditional classification algorithms.

关键词：MAPREDUCE SVM分类算法遗传算法云计算

分类号：TP301.6[自动化与计算机技术—计算机系统结构]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于MapReduce的SVM分类算法研究被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于MapReduce的SVM分类算法研究 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于MapReduce的SVM分类算法研究被引量：1