检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机工程与设计》2017年第4期1019-1023,共5页Computer Engineering and Design
基 金:国家自然科学基金项目(60975050);中央高校基本科研业务费专项基金项目(2452015197;2452015194;2452015200)
摘 要:传统K-means算法中初始质心选定的随机性可能使算法陷入局部最优解,使聚类结果不够准确。改进初始质心的选择算法,为各样本点引入局部密度指标,根据其局部密度分布情况,选取处于密度峰值的点作为初始质心,得到稳定的离收敛质心很近的初始质心,减少算法迭代次数,提高运行效率,降低陷入局部最优的概率,显著提高聚类准确性。实验结果表明,与几种已有算法相比,该算法在文本聚类中有明显优势。In traditional K-means algorithm, the randomness of the original clustering center is likely to lead to locally optimal so- lution, causing low accuracy clustering result. To improve the selection algorithm of the original clustering center, local density index was introduced, and according to the distribution of the local density, the points at the peak were selected as original clus- tering center, based on which the stable initial clustering center was very close to the convergent center. In this way, the itera- tions of the algorithm were reduced, the operating efficiency was improved, the probability of being involved in the locally opti- mal solution was reduced and the accuracy of the cluster was improved significantly. Experimental results show that compared with the existing algorithms, the proposed method is of prominent advantage in text clustering.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.227.102.59