云计算中Hadoop技术研究与应用综述  被引量:76

Review of Research and Application on Hadoop in Cloud Computing

在线阅读下载全文

作  者:夏靖波[1] 韦泽鲲 付凯[1] 陈珍[1] 

机构地区:[1]空军工程大学信息与导航学院,西安710077

出  处:《计算机科学》2016年第11期6-11,48,共7页Computer Science

基  金:陕西省自然科学基金项目(2012JZ8005)资助

摘  要:Hadoop作为当今云计算与大数据时代背景下最热门的技术之一,其相关生态圈与Spark技术的结合一同影响着学术发展和商业模式。首先介绍了Hadoop的起源和优势,阐明相关技术原理,如MapReduce,HDFS,YARN,Spark等;然后着重分析了当前Hadoop学术研究成果,从MapReduce算法的改进与创新、HDFS技术的优化与创新、二次开发与其它技术相结合、应用领域创新与实践4个方面进行总结,并简述了国内外应用现状。而Hadoop与Spark结合是未来的趋势,最后展望了Hadoop未来研究的发展方向和亟需解决的问题。Hadoop is one of the most popular technologies in the area of cloud computing and big data nowadays,the combination of its relevant software ecosystem with Spark technology influences the academic development and business model.This paper firstly introduced the origin and advantages of Hadoop,and clarified the relevant technical principles,such as MapReduce,HDFS,YARN,Spark and so on.Then we focused on the analysis of the current Hadoop academic research achievements,and summarized four aspects:the improvement and innovation of the MapReduce algorithm,optimization and innovation of technology of HDFS,secondary development and other combination,innovation and practice of application field.And then the developing situation of domestic and foreign application was described.Hadoop with the Spark is the trend of the future.This paper finally discussed the development direction of the future research and some crucial problems which should be solved pressingly.

关 键 词:云计算 大数据 HADOOP SPARK MAPREDUCE 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象