基于结构相似性的k-modes算法被引量：2

k-modes algorithm based on structural similarity

机构地区：[1]广东工业大学应用数学学院,广州510520 [2]佛山科技技术学院数学与大数据学院,广东佛山528000 [3]广东工业大学计算机学院,广州510006

出　　处：《计算机工程与应用》2017年第23期102-107,共6页Computer Engineering and Applications

基　　金：国家自然科学基金(No.61472089);广东省自然科学基金(No.2014A030308008);软件新技术国家重点实验室开放课题(No.KFKT2014B23)

摘　　要：聚类是数据挖掘中重要的技术之一,它是按照相似原则将数据进行分类。然而分类型数据的聚类是学习算法中重要而又棘手的问题。传统的k-modes算法采用简单的0-1匹配方法定义两个属性值之间的相异度,没有将整个数据集的分布考虑进来,导致差异性度量不够准确。针对这个问题,提出基于结构相似性的k-modes算法。该算法不仅考虑属性值它们本身的异同,而且考虑了它们在其他属性下所处的结构。从集群识别和准确率两个方面进行仿真实验,表明基于结构相似性的k-modes算法在伸缩性和准确率方面更有效。Clustering is one of the important technology in data mining, which is based on similar principles to classify data. However, categorical data clustering is an important and difficult issue among many learning algorithms. The traditional k-modes algorithm uses a simple 0-1 matching method to define dissimilarity between two attribute values, does not take the distribution of the entire data set into account, which results in inaccurate measurement differences. Aiming at this problem, a k-modes algorithm based on structure similarity is proposed. The algorithm not only considers the attribute values of their own similarities and differences, but also considers the structure of them in other attributes. The simulation results from two aspects of cluster identification and accuracy show that the k-modes algorithm based on structure similarity is more effective in scalability and accuracy.

关键词：聚类分析分类型数据相异度度量结构相似性 k-modes算法

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于结构相似性的k-modes算法被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于结构相似性的k-modes算法 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于结构相似性的k-modes算法被引量：2