检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:刘昱 胡立华[1] LIU Yu;HU Lihua(School of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024)
机构地区:[1]太原科技大学计算机科学与技术学院,太原030024
出 处:《计算机与数字工程》2023年第6期1250-1255,共6页Computer & Digital Engineering
摘 要:密度峰值聚类是一类具有代表性的聚类分析方法,但针对复杂数据集时,其聚类效果较差。论文利用数据对象的近邻信息,提出了一种密度峰值聚类分析算法。该算法首先采用数据对象的K近邻,计算数据对象局部密度,并通过与其K近邻的密度和距离的比值得到邻域密度比,重新定义了DPC密度计算方法,有效地解决了DPC截断距离dc在选择上的随意性;其次利用数据对象之间的相似性度量,结合影响空间、共享K近邻和密度比,给出了一种新的数据对象之间的相似性度量方法;然后利用数据对象的距离和密度相似的影响因素并与相似近邻结合,改进了FKNN-DPC分配策略。最后采用UCI数据集,实验验证了该算法具有良好的聚类簇效果。Density peak clustering is a representative cluster analysis method,but its clustering effect is poor for complex data sets.In this paper,a clustering analysis algorithm of density peak is proposed by using the nearest neighbor information of data ob-jects.Firstly,the local density of the data object is calculated by using the k-nearest neighbor of the data object,and the neighbor-hood density ratio is obtained by the ratio of the density and distance of its k-nearest neighbor.The DPC density calculation method is redefined,and the DPC cutoff distance dc is effectively solved.Secondly,using the similarity measure between data objects,combined with influence space,shared k-nearest neighbor and density ratio,a new similarity measure method between data objects is proposed.Then,using the influence factors of distance and density similarity of the data objects and combined with most similar nearest neighbor,FKNN-DPC allocation strategy is improved.Finally,experiments on UCI datasets show that the algorithm has a good cluster effect.
关 键 词:聚类分析 密度峰值 相似性度量 聚类簇扩展 密度
分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.38