基于改进人工蜂群算法与MapReduce的大数据聚类算法  被引量:14

Clustering algorithm of big data based on improved artificial bee colony algorithm and MapReduce

在线阅读下载全文

作  者:孙倩 陈昊 李超 Sun Qian;Chen Hao;Li Chao(Informationization Management Department,Hubei University,Wuhan 430062,China;School of Computer Science&Information Engineering,Hubei University,Wuhan 430062,China)

机构地区:[1]湖北大学信息化建设与管理处,武汉430062 [2]湖北大学计算机与信息工程学院,武汉430062

出  处:《计算机应用研究》2020年第6期1707-1710,1764,共5页Application Research of Computers

基  金:湖北省教育厅科学技术研究重点项目(D20141005)。

摘  要:针对大数据聚类算法计算效率与聚类性能较低的问题,提出了一种基于改进人工蜂群算法与MapReduce的大数据聚类算法。将灰狼优化算法与人工蜂群算法结合,同时提高人工蜂群算法的搜索能力与开发能力,该策略能够有效地提高聚类处理的性能;采用混沌映射与反向学习作为ABC种群的初始化策略,提高搜索的解质量;将聚类算法基于Hadoop的MapReduce编程模型实现,通过最小化类内距离的平方和实现对大数据的聚类处理。实验结果表明,该算法有效地提高了大数据集的聚类质量,同时加快了聚类速度。Aiming at the problems of low computational efficiency and low clustering performance of clustering algorithms for big data,this paper proposed a clustering algorithm of big data based on the improved ABC algorithm and MapReduce. This algorithm combined the grey wolf optimizer algorithm and ABC algorithm,and improved the exploration and exploitation of the ABC algorithm simultaneously,it could help to improve the clustering performance effectively. The algorithm utilized the chaotic map and backward learning as the initial strategy of ABC colony to improve the solution quality of search procedure. It realized the clustering algorithm based on MapReduce programming model,and realized the clustering process for big data by minimizing the quadratic sum of inner class distances. Experimental results demonstrate that the proposed algorithm improves the clustering quality of big data,and speedups the clustering procedure.

关 键 词:数据分析 聚类算法 人工蜂群算法 灰狼优化算法 云计算 分布式计算 

分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象