基于Hadoop的改进型遗传聚类算法被引量：1

Improved Genetic Clustering Algorithm Based on Hadoop

作　　者：潘俊辉[1] 王辉[1] 张强[1] 王浩畅[1] PAN Jun-Hui;WANG Hui;ZHANG Qiang;WANG Hao-Chang(School of Computer and Information Technology,Northeast Petroleum University,Daqing 163318,China)

机构地区：[1]东北石油大学计算机与信息技术学院,大庆163318

出　　处：《计算机系统应用》2021年第9期242-246,共5页Computer Systems & Applications

基　　金：国家自然科学基金(61702093);东北石油大学青年科学基金(2020QNL-02)。

摘　　要：针对经典K-means聚类算法存在易陷入局部最优解的缺点,提出并实现了一种基于Hadoop的改进型遗传聚类算法.该算法利用遗传算法具有全局性和并行性的特点去处理K-means聚类算法易陷入局部最优的缺点,在此基础上对遗传算法进行改进,然后将改进后的遗传算法与K-means算法相结合,为提高算法执行效率,将其基于Hadoop平台进行了实现.通过实验将该改进方法与经典聚类算法进行对比分析,实验结果表明该方法在聚类准确性和聚类效率上均有较大的提高.Concerning the shortcoming that the classical K-means clustering algorithm is easy to fall into the local optimum,an improved genetic clustering algorithm based on Hadoop is proposed and implemented.The algorithm overcomes the above shortcoming with the globality and parallelism of the genetic algorithm.On this basis,the genetic algorithm is improved and then combined with the classical K-means algorithm.To improve the implementation efficiency,we implement the improved genetic clustering algorithm on Hadoop.The proposed method is compared with the classical clustering algorithm through experiments.The results show that the proposed method can greatly improve the clustering accuracy and efficiency.

关键词：K-MEANS 文本聚类遗传算法 HADOOP 并行性

分类号：TP18[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于Hadoop的改进型遗传聚类算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于Hadoop的改进型遗传聚类算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于Hadoop的改进型遗传聚类算法被引量：1