检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:孙璐 梁永全 SUN Lu;LIANG Yongquan(School of Computer Science and Engineering,Shandong University of Science and Technology,Qingdao,Shandong 266590,China)
机构地区:[1]山东科技大学计算机科学与工程学院,山东青岛266590
出 处:《计算机工程与应用》2022年第14期73-79,共7页Computer Engineering and Applications
基 金:国家自然科学基金(91746104)。
摘 要:针对基于密度的噪声应用空间聚类算法(density based spatial clustering of applications with noise,DBSCAN)计算复杂度较高以及无法聚类多密度数据集等问题,提出了一种网格聚类算法和DBSCAN相结合的融合聚类算法(G_FDBSCAN)。利用网格划分技术将数据集划分为稀疏区域和密集区域,分而治之,降低计算的时间复杂度和采用全局参数引起的聚类误差;改进传统的DBSCAN聚算法得到FDBSCAN,将密集区域中网格聚类的结果作为一个整体参与后续的聚类,在网格划分基础上进行邻域检索,减少邻域检索和类扩展过程中对象的无效查询和重复查询,进一步减少时间开销。理论分析和实验测试表明,改进后的算法与DBSCAN算法、DPC算法、KMEANS算法、BIRCH算法和CBSCAN算法相比,在聚类结果接近或达到最优的情况下,聚类效率分别平均提升了24倍、11倍、2倍、3倍和1倍。Aiming at the high computational complexity of density based spatial clustering of applications with noise(DBSCAN),as well the inability to cluster multi-density datasets,a fusion clustering algorithm(G_FDBSCAN)combin-ing the grid clustering algorithm and DBSCAN is proposed.The new algorithm introduces grid division to divide the data-set into sparse areas and dense areas for processing respectively,so as to reduce the time complexity of calculation and the clustering error caused by global parameters.Then,it improves the traditional DBSCAN clustering algorithm to obtain FDBSCAN to take the results of grid clustering in dense areas as a whole to participate in the subsequent clustering,and carries out neighborhood retrieval on the basis of grid division,so as to reduce the invalid query and repeated query of objects in the process of neighborhood retrieval and class expansion,which further reduces the time overhead.Theoretical analysis and experimental tests show that compared with DBSCAN algorithm,DPC algorithm,KMEANS algorithm,BIRCH algorithm and CBSCAN algorithm,when the clustering results are optimal or close to,the clustering efficiency is increased by 24 times,11times,2 times,3 times and1 time respectively.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222