Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures  

Performance Optimization and Comparison of the Alternating Direction Implicit CFD Solver on Multi-core and Many-Core Architectures

在线阅读下载全文

作  者:DENG Liang ZHAO Dan BAI Hanli WANG Fang 

机构地区:[1]Computational Aerodynamics Institue, China Aerodynamics Research and Development Center [2]Collaborative Innovation Center of High Performance Computing, National University of Denfense Technology [3]School of Computer, National University of Denfense Technology

出  处:《Chinese Journal of Electronics》2018年第3期540-548,共9页电子学报(英文版)

基  金:the National Key Research and Development Program of China(No.2016YFB0200703);the National Natural Science Foundation of China(No.61379056)

摘  要:We accelerate a double precision Alternating direction implicit(ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house Computational fluid dynamics(CFD) software on the latest multi-core and many-core architectures(Intel Sandy Bridge CPUs, Intel Many integrated core(MIC) coprocessors and NVIDIA Kepler K20c GPUs). Some performance optimization techniques are detailed discussed. We provide an in-depth analysis on the performance difference between Sandy Bridge and MIC. Experimental results show that the proposed GPU-enabled ADI solver can achieve a speedup of 5.5 on a Kepler GPU in contrast to two Sandy Bridge CPUs and our optimization techniques can improve the performance of the ADI solver by 2.5-fold on two Sandy Bridge CPUs and 1.7-fold on an Intel MIC coprocessor. We perform a cross-platform performance analysis(between GPU and MIC), which serves as case studies for developers to select the right accelerators for their target applications.We accelerate a double precision Alternating direction implicit(ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house Computational fluid dynamics(CFD) software on the latest multi-core and many-core architectures(Intel Sandy Bridge CPUs, Intel Many integrated core(MIC) coprocessors and NVIDIA Kepler K20c GPUs). Some performance optimization techniques are detailed discussed. We provide an in-depth analysis on the performance difference between Sandy Bridge and MIC. Experimental results show that the proposed GPU-enabled ADI solver can achieve a speedup of 5.5 on a Kepler GPU in contrast to two Sandy Bridge CPUs and our optimization techniques can improve the performance of the ADI solver by 2.5-fold on two Sandy Bridge CPUs and 1.7-fold on an Intel MIC coprocessor. We perform a cross-platform performance analysis(between GPU and MIC), which serves as case studies for developers to select the right accelerators for their target applications.

关 键 词:Multi-block Structured grid Alternating direction implicit(ADI) CFD solver Sandy bridge MIC GPU Optimization techniques Performance comparison 

分 类 号:TP332[自动化与计算机技术—计算机系统结构] O572.131[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象