Reverse-Nearest-Neighbor-Based Clustering by Fast Search and Find of Density Peaks  被引量:2

在线阅读下载全文

作  者:ZHANG Chunhao XIE Bin ZHANG Yiran 

机构地区:[1]College of Computer and Cyber Security,Hebei Normal University,Shijiazhuang 050024,China [2]Hebei Provincial Engineering Research Center for Supply Chain Big Data Analytics and Data Security,Hebei Normal University,Shijiazhuang 050024,China [3]Hebei Provincial Key Laboratory of Network and Information Security,Hebei Normal University,Shijiazhuang 050024,China

出  处:《Chinese Journal of Electronics》2023年第6期1341-1354,共14页电子学报(英文版)

基  金:supported by the National Natural Science Foundation of China(62076088);the Technological Innovation Foundation of Hebei Normal University(L2020K09).

摘  要:Clustering by fast search and find of density peaks(CFSFDP)has the advantages of a novel idea,easy implementation,and efficient clustering.It has been widely recognized in various fields since it was proposed in Science in 2014.The CFSFDP algorithm also has certain limitations,such as non-unified sample density metrics defined by cutoff distance,the domino effect for the assignment of remaining samples triggered by unstable assignment strategy,and the phenomenon of picking wrong density peaks as cluster centers.We propose reverse-nearest-neighbor-based clustering by fast search and find of density peaks(RNN-CFSFDP)to avoid these shortcomings.We redesign and unify the sample density metric by introducing reverse nearest neighbor.The newly defined local density metric and the K-nearest neighbors of each sample are combined to make the assignment process more robust and alleviate the domino effect.A cluster fusion algorithm is proposed,which further alleviates the domino effect and effectively avoids the phenomenon of picking wrong density peaks as cluster centers.Experimental results on publicly available synthetic data sets and real-world data sets show that in most cases,the proposed algorithm is superior to or at least equivalent to the comparative methods in clustering performance.The proposed algorithm works better on manifold data sets and uneven density data sets.

关 键 词:Density peaks Reverse nearest neighbor CLUSTERING Cluster fusion 

分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象