基于OpenACC的高性能计算并行优化研究与应用  被引量:3

Research and Application of Parallel Optimization in High Performance Computing Based on OpenACC

在线阅读下载全文

作  者:顾文静[1] 孙晨 王彬[1] GU Wen-jing;SUN Chen;WANG Bin(High Performance Computing Division,National Meteorological Information Center,Beijing 100081,China)

机构地区:[1]国家气象信息中心高性能计算室,北京100081

出  处:《计算机技术与发展》2018年第4期65-70,共6页Computer Technology and Development

基  金:国家重点研发计划项目(2016YFA0602102);公益性行业专项(气象)科研专项(GYHY201306062);中国气象局局校合作项目

摘  要:针对GPU加速时存在的编码复杂性、移植性差导致开发维护效率低下的缺陷,利用基于OpenACC指导命令的加速技术对传统的串行代码进行改写,从而达到提高开发效率、简化代码的目的。以GRAPES全球模式长波辐射过程为研究对象,首先通过编译选项对程序性能进行初步优化,再根据其数据依赖和访存特性,对数据和循环结构进行预处理并添加OpenACC指导命令实现循环级并行。实验结果表明,长波辐射过程并行计算结果正确,在不改变原有代码结构的基础上即可获得4~6倍的加速比,优化性能可比拟相同计算能力的Intel集群,虽然较GPU加速仍有差距,但大大增强了代码的可读性和可移植性,且随着编译器和硬件技术的发展,OpenACC有着广阔的发展空间。For the inefficiency of development and maintenance caused by complex coding and poor portability in GPU acceleration,we make use of the acceleration technology based on the OpenACC to rewrite the traditional serial code for improving the development efficiency and simplifying the code.In this paper,taking the long wave radiation in GRAPES model as research object,the preliminary optimization of procedure performance is carried on by compiler options first,and then the data and loop structure is preprocessed with adding OpenACC instruction to implement the parallel of loop according to the data dependence and memory accessing feature.The experiments indicate that the parallel computing of long wave radiation is correct with the acceleration of 4 to 6 times on basis of the original non-parallel code structure.The optimal performance can be compared to the Inter cluster in same computing power.Although still lower than GPU acceleration,the readability and portability of the code are greatly enhanced.With the development of the compiler and hardware technology,the OpenACC has a broad space for development.

关 键 词:神威·太湖之光 OpenACC GRAPES模式 长波辐射过程 

分 类 号:TP301[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象