检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:丁雨 张瀚霖 罗荣[1] 孟华[1] DING Yu;ZHANG Hanlin;LUO Rong;MENG Hua(School of Mathematics,Southwest Jiaotong University,Chengdu Sichuan 611756,China)
出 处:《计算机应用》2024年第4期1128-1138,共11页journal of Computer Applications
基 金:中央高校基本科研业务费专项资金资助项目(2682023ZTPY027)。
摘 要:信念峰值聚类(BPC)算法是一种基于模糊视角的密度峰值聚类(DPC)算法的新变体,它用模糊数学的观点刻画数据的分布特征与相关性。但BPC算法的信念值计算主要基于局部数据点信息,未考察数据集整体的分布和结构,且原始的分配策略鲁棒性弱。针对以上问题,提出一种基于信念子簇切割的模糊聚类算法(BSCC),所提算法结合了信念峰值和谱方法。首先,通过局部信念信息将数据集划分为众多高纯度子簇;其次,将子簇视作新样本,通过簇间的相似关系,利用谱方法进行割图聚类,从而耦合局部信息与全局信息;最后,将子簇内的点分配至子簇所在类簇以完成最终聚类。与BPC算法相比,BSCC在带有多子簇结构的数据集上具有明显优势,如在americanflag数据集和Car数据集上的准确率(ACC)分别提高了16.38个百分点和21.35个百分点。在合成数据集和真实数据集上的聚类实验结果表明,BSCC在调整兰德系数(ARI)、归一化互信息(NMI)和ACC这3个评价指标上整体优于BPC和其他7种聚类算法。Belief Peaks Clustering(BPC)algorithm is a new variant of Density Peaks Clustering(DPC)algorithm based on fuzzy perspective.It uses fuzzy mathematics to describe the distribution characteristics and correlation of data.However,BPC algorithm mainly relies on the information of local data points in the calculation of belief values,instead of investigating the distribution and structure of the whole dataset.Moreover,the robustness of the original allocation strategy is weak.To solve these problems,a fuzzy Clustering algorithm based on Belief Subcluster Cutting(BSCC)was proposed by combining belief peaks and spectral method.Firstly,the dataset was divided into many high-purity subclusters by local belief information.Then,the subcluster was regarded as a new sample,and the spectral method was used for cutting graph clustering through the similarity relationship between clusters,thus coupling local information and global information.Finally,the points in the subcluster were assigned to the class cluster where the subcluster was located to complete the final clustering.Compared with BPC algorithm,BSCC has obvious advantages on datasets with multiple subclusters,and it has the ACCuracy(ACC)improvement of 16.38 and 21.35 percentage points on americanflag dataset and Car dataset,respectively.Clustering experimental results on synthetic datasets and real datasets show that BSCC outperforms BPC and the other seven clustering algorithms on the three evaluation indicators of Adjusted Rand Index(ARI),Normalized Mutual Information(NMI)and ACC.
关 键 词:聚类分析 密度峰值聚类 信念峰值聚类 谱聚类 信念子簇 子簇合并
分 类 号:TP181[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222