Evaluation of the computational performance of the finite-volume atmospheric model of the IAP/LASG(FAMIL) on a high-performance computer  被引量:9

Evaluation of the computational performance of the finite-volume atmospheric model of the IAP/LASG(FAMIL) on a high-performance computer

在线阅读下载全文

作  者:LI Jin-Xiao BAO Qing LIU Yi-Min WU Guo-Xiong 

机构地区:[1]State Key Laboratory of Numerical Modeling for Atmospheric Sciences and Geophysical Fluid Dynamics (LASG), Institute of Atmospheric Physics,Chinese Academy of Sciences [2]College of Earth Science, University of Chinese Academy of Science

出  处:《Atmospheric and Oceanic Science Letters》2017年第4期329-336,共8页大气和海洋科学快报(英文版)

基  金:supported by the National Natural Science Foundation of China[grant number 41675100],[grant number91337110];the Third Tibetan Plateau Scientific Experiment:Observations for Boundary Layer and Troposphere[GYHY201406001];the Key Research Program of Frontier Sciences,Chinese Academy of Science(CAS)(QYZDY-SSW-DQC018);the Special Program for Applied Research on Super Computation of the NSFC-Guangdong Joint Fund(the 2nd phase)

摘  要:High computational performance is extremely important for climate system models, especially in ultra-high-resolution model development. In this study, the computational performance of the Finite-volume Atmospheric Model of the IAP/LASG (FAMIL) was comprehensively evaluated on Tianhe-2, which was the world's top-ranked supercomputer from June 2013 to May 2016. The standardized Atmospheric Model Inter-comparison Project (AMIP) type of experiment was carried out that focused on the computational performance of each node as well as the simulation year per day (SYPD), the running cost speedup, and the scalability of the FAMIL. The results indicated that (1) based on five indexes (CPU usage, percentage of CPU kernel mode that occupies CPU time and of message passing waiting time (CPU SW), code vectorization (VEC), average of Gflops (Gflops_ AVE), and peak of Gflops (Gflops_PK)), FAMIL shows excellent computational performance on every Tianhe-2 computing node; (2) considering SYPD and the cost speedup of FAMIL systematically, the optimal Message Passing Interface (MPI) numbers of processors (MNPs) choice appears when FAMIL use 384 and 1536 MNPs for C96 (100 km) and C384 (25 km), respectively; and (3) FAMIL shows positive scalability with increased threads to drive the model. Considering the fast network speed and acceleration card in the MIC architecture on Tianhe-2, there is still significant room to improve the computational performance of FAMIL.高性能计算指标对于气候模式的发展,尤其是高分辨率气候模式的发展至关重要。本文系统性的评估了IAP/LASG新一代有限体积大气环流模式FAMIL在Tianhe-2超算平台上的计算性能。通过标准的AMIP试验,本文重点评估FAMIL在每个节点的高性能指标、每天所积分的模式年(SYPD)、计算成本加速比、可扩展性五方面的性能。结果表明:1)FAMIL在CPU使用率、CPU节点间信息传输等待时间、代码向量化、Gflops平均值、Gflops峰值五个方面表现出优异性能。2)综合考虑SYPD和计算成本加速比,对于C96(100km)和C384(25km)水平分辨率的FAMIL积分,最优进程使用数(MNPs)分别为384和1536。3)FAMIL具有良好的可扩展性,并且随着Tianhe-2网络传输速度的提升及MIC加速的使用,FAMIL的计算新能可以进一步得到提升。

关 键 词:FAMIL scalability computational performance Tianhe-2 

分 类 号:P409[天文地球—大气科学及气象学] P435

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象