混合属性数据聚类初始点选择的改进被引量：3

Improved Clustering Algorithm for Mixed Numeric and Categorical Values

出　　处：《广西师范大学学报（自然科学版）》2007年第4期220-223,共4页Journal of Guangxi Normal University:Natural Science Edition

基　　金：国家自然科学基金资助项目(70171033);江苏省高校自然科学基础研究基金资助项目(07KJ520216)

摘　　要：k-prototypes和模糊k-prototypes是处理数值属性和分类属性混合数据主要的聚类算法。但这两种聚类算法不足之处是对初值有明显的依赖。对初值选取方法进行了分析和研究,提出一种新的改进方法,可在一定程度上减少随机性。实际数据集仿真结果表明改进算法有更高的稳定性和较强的伸缩性。The k-prototypes algorithm and Fuzzy k-prototypes algorithm have become popular technique in solving categorical data clustering problems in different application domains. However, they also reuires random selection of initial points for the clusters. So it is obvious that outputs are especially sensitive to initial. Different initial points often lead to considerable distinct clustering results. This paper analyses the method of random selection and proposes a method of searching initial starting points through grouping data sets. Experiments show that the new initialization method leads to higher stability and flexibility.

关键词：聚类 k—modes k—prototypes 分类型数据相异度

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

混合属性数据聚类初始点选择的改进被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

混合属性数据聚类初始点选择的改进 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

混合属性数据聚类初始点选择的改进被引量：3