面向国产加速卡的OpenFOAM线程并行加速研究  

Research on Parallel Acceleration of OpenFOAM Threads for Domestic Accelerator

在线阅读下载全文

作  者:尚小敏 李强 高凌云 陶顺安 周全 袁武[2,3] 陆忠华 SHANG Xiaomin;LI Qiang;GAO Lingyun;TAO Shunan;ZHOU Quan;YUAN Wu;LU Zhonghua(School of Computer Science and Technology,Qingdao University,Qingdao,Shandong 266071,China;Computer Network Information Center,Chinese Academy of Sciences,Beijing 100083,China;University of Chinese Academy of Sciences,Beijing 100049,China)

机构地区:[1]青岛大学计算机科学技术学院,山东青岛266071 [2]中国科学院计算机网络信息中心,北京100083 [3]中国科学院大学,北京100049

出  处:《数据与计算发展前沿》2024年第2期134-144,共11页Frontiers of Data & Computing

基  金:国家重点研发计划(2020YFB1709500);山东省自然科学基金面上项目(ZR201910310143)。

摘  要:【背景】随着流体力学模拟的精细化,CFD流体模拟软件OpenFOAM对算力的需求持续增加,新型东方超级计算系统是国产自研的新型异构超算。【目的】以新型东方超级计算系统为平台移植OpenFOAM,实现OpenFOAM的国产超算适配与加速。【方法】首先,通过分析“东方”超级计算系统和OpenFOAM的功能架构,本文制定了适用于国产加速卡的求解器,通过使用本文移植后的CUSP来调用国产加速卡底层代码从而实现稀疏存储格式矩阵向量乘以及diagonal矩阵预处理。其次,在此基础上,实现了单节点多国产加速卡的并行SpMV。【结果】本文使用了OpenFOAM自带的pitzDaily算例进行算法验证,并通过多种加速性能的对比方法对测试性能进行分析,取得了19.7倍的加速效果。【局限】本研究只实现了单节点的OpenFOAM并行优化。【结论】本研究结果对于发挥OpenFOAM在流体力学方面的优势和扩大超算软件适应面具有重要意义。[Background]With the refinement of computational fluid dynamics simulations,the demand for computing power from CFD software OpenFOAM continues to increase.The new“East”supercomputing system is a new type of home-grown heterogeneous supercomputer to satisfy the demand.[Purpose]OpenFOAM is optimized and accelerated by porting it to the Oriental Supercomputer.[Methods]Firstly,by analyzing the functional architecture of the“East”super-computing system and OpenFOAM,a solver for the domestic accelerator is developed and the ported CUSP is used to call the underlying code of the domestic accelerator to realize the sparse storage format matrix vector multiplication and diagonal matrix preprocessing.Secondly,a parallel SpMV(matrix-vector multiplication)on a single-node card with multiple domestic accelerators is implemented.[Results]The work introduced in this paper is verified by the pitzDaily example provided by OpenFOAM,and the performance is evaluated through comparison with several comparative methods of acceleration,which shows our approach achieves a 19.7 times speedup.[Limitations]This study only realizes a parallel optimization of OpenFOAM on a singlenode.[Conclusions]The results of this study are of great significance for unleashing the advantages of OpenFOAM in fluid mechanics and expanding the scope of the application of supercomputer software.

关 键 词:OPENFOAM 并行计算 异构计算 国产加速卡 移植优化 

分 类 号:TP338.4[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象