学习特征权值对聚类算法的优化

Optimization of Clustering by Feature Weight Learning

出　　处：《重庆工学院学报（自然科学版）》2008年第5期73-76,共4页Journal of Chongqing Institute of Technology

基　　金：2007年度国家火炬计划资助项目(2007GH010200)

摘　　要：现有的k-均值聚类算法大都是以距离差异为基础的,而同等重要地依赖所有属性的相似性度量会引起误导.传统的k-均值算法选择的相似性度量通常是欧几里德距离的倒数,这种距离通常涉及所有的特征.而在距离公式中引入一些特征权值后,其聚类结果将依赖于这些权值,从而可以通过调整这些权值优化聚类效果.由于k-均值算法是迭代算法,很难直接确定其权值以优化聚类结果,因此提出了一种通过免疫算法学习权值的方法以改进聚类结果.实验结果显示,该方法确定的权重值在提高聚类效果方面是可行的、有效的.The existing k-means clustering algorithms are mostly based on distance difference, but equally depending on the similarity measurement of all attributes in the same degree tends to arouse misleading. The Euclidean distance is usually chosen as the similarity measure in the conventional k-means clustering algorithm, which usually relates to all attributes. When feature weight parameters are introduced to the distance formula, the performance will depend on the weight values and accordingly can be improved by adjusting weight values. Since k-means algorithm is iterative, it is difficult to optimize clustering results by giving weight values directly. A learning feature weight algorithm based on immune algorithm is introduced to improve the clustering result. Experimental results demonstrate that this method of determining the weight values is feasible and efficient in improving the clustering performance.

关键词：特征权值 K-均值算法免疫算法聚类算法

分类号：TP183[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

学习特征权值对聚类算法的优化

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

学习特征权值对聚类算法的优化

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索