谱聚类中选取特征向量的动态选择性集成方法  被引量:5

Eigenvector Selection Algorithm for Spectral Clustering Based on Dynamic Selective Ensemble

在线阅读下载全文

作  者:王兴良[1] 王立宏[1] 武栓虎[1] 

机构地区:[1]烟台大学计算机学院,烟台264005

出  处:《模式识别与人工智能》2014年第5期452-462,共11页Pattern Recognition and Artificial Intelligence

基  金:国家自然科学基金项目(No.61170224);山东省自然科学基金项目(No.ZR2012FL07);烟台大学青年基金项目(No.JS11Z8)资助

摘  要:谱聚类中k个最大特征值对应的特征向量不一定使聚类结果达到最好,因此,文中采用特征向量组的选择性集成方法以提高谱聚类性能,涉及基特征向量组的选取、选择性集成策略等问题.利用训练数据的成对约束信息进行打分,选出较好的基特征向量组;应用测试数据在训练数据中的l-最近邻的聚类性能指标,动态评价每组特征向量,选出少量几个参与投票的特征向量组;对测试数据集的几个特征向量组数据进行谱聚类,并对结果进行簇配准,给出最终的聚类结果.实验表明,采用动态选择性集成方法能提高测试数据的聚类性能.Since the corresponding eigenvectors of k maximum eigenvalues do not always achieve the optimal clustering results, the clustering performance is improved by selective integrated approach for eigenvector groups involving the selection of base eigenvector group and selective integration strategy. Constraint score is used to evaluate eigenvectors by the pair-wise constraint information of training data, and some prefera-ble base eigenvector groups are obtained. For each testing data, the clustering accuracy of l-nearest neighbors from training dataset are used to dynamically evaluate eigenvector groups, and several accurate eigenvector groups are selected to vote. To test the obtained eigenvector groups, spectral clustering is carried out on the corresponding eigenvectors of testing dataset. The clustering results are aligned and the final experimental results are obtained. The experimental results on UCI benchmark datasets show that the proposed algorithm improves the clustering performance of testing data.

关 键 词:谱聚类 选择性集成 特征向量选取 约束计分 l-最近邻 

分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象