基于空间平移的K-Means初始簇心选取

K-Means Initial Cluster Center Selection Based on Spatial Translation

作　　者：朱家乐

出　　处：《应用数学进展》2024年第9期4381-4390,共10页Advances in Applied Mathematics

摘　　要：K-means聚类算法因其算法简单、计算效率高,在机器学习、数据挖掘等多个领域得到了广泛应用。然而,传统K-means算法在初始簇心的选取上存在随机性,这可能导致聚类结果的不稳定性。为了解决这一问题,本研究提出了一种基于空间平移的初始簇心选取算法。该算法首先将包含所有样本集的最小空间通过单位空间以一定步长遍历,在单位空间内统计样本点的密度,以此降低计算量。通过逐一选出密度最高的个点作为初始簇心,从而提高了K-means算法的聚类性能。在UCI的12种数据集上进行的实验表明,与传统的K-means、K-means++等算法相比,改进的算法在迭代次数上有所降低,聚类准确率得到了显著提高。K-means clustering algorithm is an important content in the field of machine learning and is widely used because of its simplicity and efficiency. In order to solve the problem that the initial cluster center selection of traditional K-means algorithm is random, an initial cluster center selection algorithm based on space segmentation is proposed. The minimum space containing all sample sets is divided to calculate the density, and the initial cluster centers with the highest density are selected one by one. The selected cluster centers are replaced by random initial cluster centers for K-means clustering. Twelve datasets were tested separately at UCI. The experimental results show that compared with traditional K-means, K-means++ and other algorithms, the improved algorithm has lower iteration times and higher clustering accuracy.

关键词：K-MEANS 初始聚类中心密度空间平移

分类号：TP3[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于空间平移的K-Means初始簇心选取

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于空间平移的K-Means初始簇心选取

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索