检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:何云斌[1] 董恒 万静[1] 李松[1] HE Yunbin;DONG Heng;WAN Jing;LI Song(College of Computer Science and Technology,Harbin University of Science and Technology,Harbin 150080,China)
机构地区:[1]哈尔滨理工大学计算机科学与技术学院,哈尔滨150080
出 处:《计算机科学与探索》2020年第4期554-565,共12页Journal of Frontiers of Computer Science and Technology
基 金:国家自然科学基金No.61872105;黑龙江省自然科学基金No.F201302;黑龙江省教育厅科学技术研究项目No.12531z004。
摘 要:针对密度峰值算法在选取聚类中心时的时间复杂度过高,需要人工选择截断距离并且处理流形数据时有可能出现多个密度峰值,导致聚类准确率下降等问题,提出一种新的密度峰值聚类算法,从聚类中心选择、离群点筛选、数据点分配三方面进行讨论和分析,并给出相应的聚类算法。在聚类中心的选择上采取KNN的思想计算数据点的密度,离群点的筛选和剪枝以及数据点分配则利用Voronoi图的性质,结合数据点的分布特征进行处理,并在最后应用层次聚类的思想以合并相似类簇,提高聚类准确率。实验结果表明:所提算法与实验对比算法相比较,具有较好的聚类效果和准确性。The time complexity of density peak algorithm in selecting the cluster center is very high.It needs to manually select the cutoff distance.When processing the manifold data,there may be multiple density peaks,which leads to the decrease of clustering accuracy.In this paper,a new density peak clustering algorithm is proposed.This paper discusses and analyzes the clustering algorithm from three aspects of clustering center selection,outlier filtering and data point allocation.The clustering algorithm uses the KNN idea to calculate the density of data points in the selection of the cluster center.The screening and pruning of the outliers and the data point allocation are processed by the properties of the Voronoi diagram combined with the distribution characteristics of the data points.Finally,the hierarchical clustering idea is applied to merge similar clusters to improve clustering accuracy.The experimental results show that compared with the experimental comparison algorithms,the proposed algorithm has better clustering effect and accuracy.
分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.116.237.222