高阶精度CFD应用在天河2系统上的异构并行模拟与性能优化  被引量:5

Heterogeneous Computing and Optimization on Tianhe-2 Supercomputer System for High-Order Accurate CFD Applications

在线阅读下载全文

作  者:王勇献[1,2] 张理论[1,2] 车永刚[1,2] 徐传福[1] 刘巍[1] 程兴华[1] 

机构地区:[1]国防科学技术大学计算机学院,长沙410073 [2]国防科学技术大学并行与分布处理重点实验室,长沙410073

出  处:《计算机研究与发展》2015年第4期833-842,共10页Journal of Computer Research and Development

基  金:国家自然科学基金项目(61379056;11272352);国家"九七三"重点基础研究发展计划基金项目(2009CB723803)

摘  要:在当前主流的众核异构高性能计算机平台上开展超大规模计算流体力学(computational fluid dynamics,CFD)应用的高效并行数值模拟仍然面临着一系列挑战性技术问题,也是该领域的热点研究问题之一.面向天河2高性能异构并行计算平台,针对高阶精度CFD流场数值模拟程序的高效并行进行了探索,重点讨论了CFD应用特点与众核异构高性能计算机平台特征相适应的性能优化策略,从任务分解、并行度挖掘、多线程优化、SIMD向量化、CPU与加速器协同优化等方面,提出一系列性能提升技术.通过在天河2高性能异构并行计算平台上进行了多个算例的数值模拟,模拟的最大CFD规模达到1 228亿个网格点,共使用约59万CPU+MIC处理器核,测试结果表明移植优化后的程序性能提高2.6倍左右,且具有良好的可扩展性.There still exist great challenges when simulating the large‐scale computational fluid dynamics ( CFD ) applications on the contemporary supercomputer systems with many‐core heterogeneous architecture like Tianhe‐2 ,which is also one of the research hotspots in this field .In this paper ,we focus on exploring the techniques of efficient parallel simulations on the heterogeneous high‐performance computing ( HPC ) platform for large‐scale CFD applications with high‐order accurate scheme .Some approaches and strategies of performance optimization matched with both the characteristic of CFD application and the architectures of heterogeneous HPC platform are proposed from the perspective of task decomposition , exploration of parallelism , optimization for multi‐threaded running ,vectorization by employing single‐instruction multiple‐data (SIMD) ,optimization for the cooperation of both CPUs and co‐processors ,and so on .To evaluate the performance of these techniques ,some numerical experiments are performed on Tianhe‐2 supercomputer system with the maximum number of grid points achieving 1 .228 ×10^11 ,and the total amount of processors and/or co‐processors being 590000 .Such a large‐scale CFD simulation with high‐order accurate scheme has to our best knowledge never been attempted before .It shows that the optimized code can get the speedup of 2 .6X on CPU and co‐processor hybrid platform than that on the CPU platform only ,and perfect scalability is also observed from the test results . The present work redefines the frontier of high performance computing for fluid dynamics simulations on heterogeneous platform .

关 键 词:计算流体动力学 高阶精度格式 并行计算 CPU+MIC异构协同并行 性能优化 天河2超级 计算机 

分 类 号:TP301[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象