检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:尚小敏 李强 高凌云 陶顺安 周全 袁武[2,3] 陆忠华 SHANG Xiaomin;LI Qiang;GAO Lingyun;TAO Shunan;ZHOU Quan;YUAN Wu;LU Zhonghua(School of Computer Science and Technology,Qingdao University,Qingdao,Shandong 266071,China;Computer Network Information Center,Chinese Academy of Sciences,Beijing 100083,China;University of Chinese Academy of Sciences,Beijing 100049,China)
机构地区:[1]青岛大学计算机科学技术学院,山东青岛266071 [2]中国科学院计算机网络信息中心,北京100083 [3]中国科学院大学,北京100049
出 处:《数据与计算发展前沿》2024年第2期134-144,共11页Frontiers of Data & Computing
基 金:国家重点研发计划(2020YFB1709500);山东省自然科学基金面上项目(ZR201910310143)。
摘 要:【背景】随着流体力学模拟的精细化,CFD流体模拟软件OpenFOAM对算力的需求持续增加,新型东方超级计算系统是国产自研的新型异构超算。【目的】以新型东方超级计算系统为平台移植OpenFOAM,实现OpenFOAM的国产超算适配与加速。【方法】首先,通过分析“东方”超级计算系统和OpenFOAM的功能架构,本文制定了适用于国产加速卡的求解器,通过使用本文移植后的CUSP来调用国产加速卡底层代码从而实现稀疏存储格式矩阵向量乘以及diagonal矩阵预处理。其次,在此基础上,实现了单节点多国产加速卡的并行SpMV。【结果】本文使用了OpenFOAM自带的pitzDaily算例进行算法验证,并通过多种加速性能的对比方法对测试性能进行分析,取得了19.7倍的加速效果。【局限】本研究只实现了单节点的OpenFOAM并行优化。【结论】本研究结果对于发挥OpenFOAM在流体力学方面的优势和扩大超算软件适应面具有重要意义。[Background]With the refinement of computational fluid dynamics simulations,the demand for computing power from CFD software OpenFOAM continues to increase.The new“East”supercomputing system is a new type of home-grown heterogeneous supercomputer to satisfy the demand.[Purpose]OpenFOAM is optimized and accelerated by porting it to the Oriental Supercomputer.[Methods]Firstly,by analyzing the functional architecture of the“East”super-computing system and OpenFOAM,a solver for the domestic accelerator is developed and the ported CUSP is used to call the underlying code of the domestic accelerator to realize the sparse storage format matrix vector multiplication and diagonal matrix preprocessing.Secondly,a parallel SpMV(matrix-vector multiplication)on a single-node card with multiple domestic accelerators is implemented.[Results]The work introduced in this paper is verified by the pitzDaily example provided by OpenFOAM,and the performance is evaluated through comparison with several comparative methods of acceleration,which shows our approach achieves a 19.7 times speedup.[Limitations]This study only realizes a parallel optimization of OpenFOAM on a singlenode.[Conclusions]The results of this study are of great significance for unleashing the advantages of OpenFOAM in fluid mechanics and expanding the scope of the application of supercomputer software.
关 键 词:OPENFOAM 并行计算 异构计算 国产加速卡 移植优化
分 类 号:TP338.4[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.135.190.163