基于密度和最近邻的K-means文本聚类算法被引量：29

K-means text clustering algorithm based on density and nearest neighbor

出　　处：《计算机应用》2010年第7期1933-1935,共3页journal of Computer Applications

基　　金：西北大学科研启动基金资助项目(PR08067);西北大学研究生自主创新基金资助项目(08YZZ35)

摘　　要：初始中心点的选择对于传统的K-means算法聚类结果影响较大,容易使聚类陷入局部最优解。针对这个问题,引入密度和最近邻思想,提出了生成初始聚类中心的算法Initial。将所选聚类中心用于K-means算法,得到了更好的应用于文本聚类的DN-K-means算法。实验结果表明,该算法可以生成聚类质量较高并且稳定性较好的结果。The selection of initial focal point has great influence on the clustering results of traditional K-means algorithm,for it tends to get a local optimal solution when inappropriately assigned.In view of this issue,initial algorithm that can generate the initial cluster center was proposed,through introducing the density and nearest neighbor idea.These selected centers were used for K-means algorithm;a better text clustering algorithm called DN-K-means was put forward.The results of experiments indicate that the algorithm can lead to results with high and steady clustering quality.

关键词：文本聚类密度最近邻 F度量

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于密度和最近邻的K-means文本聚类算法被引量：29

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于密度和最近邻的K-means文本聚类算法 被引量：29

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于密度和最近邻的K-means文本聚类算法被引量：29