基于概率密度分布的增量支持向量机算法  被引量:6

An incremental support vector machine approach based on probability density distribution

在线阅读下载全文

作  者:潘世超[1] 王文剑[1,2] 郭虎升[1] 

机构地区:[1]山西大学计算机与信息技术学院,太原030006 [2]山西大学计算智能与中文信息处理教育部重点实验室,太原030006

出  处:《南京大学学报(自然科学版)》2013年第5期603-610,共8页Journal of Nanjing University(Natural Science)

基  金:国家自然科学基金(60975035;61273291);山西省回国留学人员科研资助项目(2012-008)

摘  要:增量支持向量机(Incremental Support Vector Machine,ISVM)模型通过每次加入一个或者一批样本进行学习,将大规模问题分解成一系列子问题,以提高支持向量机(Support Vector Machine,SVM)处理大规模数据的学习效率,但传统ISVM(Traditional ISVM,TISVM)模型中增量样本的选择方法不当可能降低其效率和泛化能力.针对ISVM中增量样本的选择问题,提出了一种基于概率密度分布的ISVM算法,称为PISVM,该方法通过概率密度分布选择含有较多重要分类信息(有可能成为支持向量)的增量样本进行训练,使得分类器能够以最快的速度收敛到最优.在标准数据集UCI上的实验结果表明PISVM模型可以在保持其泛化能力的同时进一步提高学习效率.Incremental support vector machine model(ISVM)joins a sample or a batch of samples to learn in each cycle,and then the problem can be reduced from large-scale to a series of sub issues. Therefore, ISVM can improve the efficiency of support vector machine(SVM)to deal with large scale data. However, by using traditional support vector machine(TISVM),the convergence speed, efficiency and the eventual generalization ability may be decreased due to the incorrect selection of the incremental samples. To solve the problem, an ISVM approach (incremental support vector machine based on the probability density distribution, namely PISVM)is proposed through choosing those incremental training samples including much important classification information based on probability density distribution. Using the approach can make the classifier get to the optimal hyper lane at the fastest speed. In order to verify the validity of the proposed approach, some experiments are done using the three approaches: the PISVM approach,the TISVM method and the minimum distance classifier approach. The experiment results on UCI data set demonstrate that the proposed PISVM can obtain high learning efficiency with good generalization performance simultaneously.

关 键 词:支持向量机 PISVM模型 增量样本选择 概率密度分布 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象