ExaHDF5: Delivering Efficient Parallel I/O on Exascale Computing Systems  被引量:1

在线阅读下载全文

作  者:Suren Byna M.Scot Breitenfeld Bin Dong Quincey Koziol Elena Pourmal Dana Robinson Jerome Soumagne Houjun Tang Venkatram Vishwanath Richard Warren 

机构地区:[1]Lawrence Berkeley National Laboratory,Berkeley,CA 94597,U.S.A [2]The HDF Group,Champaign,IL 61820,U.S.A [3]Argonne National Laboratory,Lemont,IL 60439,U.S.A

出  处:《Journal of Computer Science & Technology》2020年第1期145-160,共16页计算机科学技术学报(英文版)

基  金:This research was supported by the Exascale Computing Project under Grant No.17-SC-20-SC;a joint project of the U.S.Department of Energy's Office of Science and National Nuclear Security Administration,responsible for delivering a capable exascale ecosystem,including software,applications,and hardware technology,to support the nation's exascale computing imperative;This work is also supported by the Director,Office of Science,Office of Advanced Scientific Computing Research,of the U.S.Department of Energy under Contract Nos.DE-AC02-05CH11231 and DE-AC02-06CH11357;This research was funded in part by the Argonne Leadership Computing Facility;which is a DOE Office of Science User Facility supported under Contract No.DE-AC02-06CH11357;This research used resources of the National Energy Research Scientific Computing Center;which is DOE Office of Science User Facilities supported by the Office of Science of the U.S.Department of Energy under Contract No.DE-AC02-05CH11231.

摘  要:Scientific applications at exascale generate and analyze massive amounts of data.A critical requirement of these applications is the capability to access and manage this data efficiently on exascale systems.Parallel I/O,the key technology enables moving data between compute nodes and storage,faces monumental challenges from new applications,memory,and storage architectures considered in the designs of exascale systems.As the storage hierarchy is expanding to include node-local persistent memory,burst buffers,etc.,as well as disk-based storage,data movement among these layers must be efficient.Parallel I/O libraries of the future should be capable of handling file sizes of many terabytes and beyond.In this paper,we describe new capabilities we have developed in Hierarchical Data Format version 5(HDF5),the most popular parallel I/O library for scientific applications.HDF5 is one of the most used libraries at the leadership computing facilities for performing parallel I/O on existing HPC systems.The state-of-the-art features we describe include:Virtual Object Layer(VOL),Data Elevator,asynchronous I/O,full-featured single-writer and multiple-reader(Full SWMR),and parallel querying.In this paper,we introduce these features,their implementations,and the performance and feature benefits to applications and other libraries.

关 键 词:parallel I/O Hierarchical Data FORMAT version 5(HDF5) I/O performance virtual OBJECT layer HDF5 OPTIMIZATIONS 

分 类 号:TP31[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象