基于谱分析的密度峰值快速聚类算法  

Clustering by fast search and find of density peaks based on spectrum analysis

在线阅读下载全文

作  者:韩忠华[1,2] 毕开元 司雯 吕哲[1] HAN Zhonghua;BI Kaiyuan;SI Wen;LYU Zhe(Faculty of Information and Control Engineering,Shenyang Jianzhu Universty,Shenyang Liaoning 110168,China;Shenyang Institute of Automation,Chinese Academy of Sciences,Shenyang Liaoning 110016,China)

机构地区:[1]沈阳建筑大学信息与控制工程学院,沈阳110168 [2]中国科学院沈阳自动化研究所,沈阳110016

出  处:《计算机应用》2019年第2期409-413,共5页journal of Computer Applications

基  金:国家自然科学基金资助项目(61503259);辽宁省科技厅面上项目(201602608);辽宁省高等学校基本科研项目(LJZ2017015);辽宁省档案科技项目(L-2018-X-10)~~

摘  要:针对密度峰值快速聚类(CFSFDP)算法对不同数据集聚类效果的差异,利用谱聚类对密度峰值快速聚类算法加以改进,提出了一种基于谱分析的密度峰值快速聚类算法CFSFDP-SA。首先,将高维非线性的数据集映射到低维子空间上实现降维处理,将聚类问题转化为图的最优划分问题以增强算法对数据全局结构的适应性;然后,利用CFSFDP算法对处理后的数据集进行聚类。结合这两种聚类算法各自的优势,能进一步提升聚类算法的性能。在5个人工合成数据集(2个线性数据集和3个非线性数据集)与4个UCI数据库中真实数据集上的聚类结果显示,相比CFSFDP算法,CFSFDP-SA算法的聚类精度有一定提升,在高维数据集的聚类精度上最多提高了14%,对原始数据集的适应性更强。For different clustering effects of Clustering by Fast Search and Find of Density Peaks(CFSFDP)on different datasets,an improved CFSFDP algorithm based on spectral clustering was proposed,namely CFSFDP-SA(CFSFDP based on Spectrum Analysis).Firstly,a high-dimensional non-linear dataset was mapped into a low-dimensional subspace to realize dimension reduction,then the clustering problem was transformed into the optimal partitioning problem of the graph to enhance the algorithm adaptability to the global structure of the data.Secondly,the CFSFDP algorithm was used to cluster the processed dataset.Combining the advantages of these two clustering algorithms,the clustering performance was further improved.The clustering results of two artificial linear datasets,three artificial nonlinear datasets and four real datasets in UCI show that compared with CFSFDP,the CFSFDP-SA algorithm has higher clustering precision,achieving up to 14%improvement in accuracy for high-dimensional dataset,which means CFSFDP-SA is more adaptable to the original datasets.

关 键 词:数据聚类 适应性 降维 密度峰值快速聚类 谱分析 

分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象