检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:DENG Liang ZHAO Dan BAI Hanli WANG Fang
机构地区:[1]Computational Aerodynamics Institue, China Aerodynamics Research and Development Center [2]Collaborative Innovation Center of High Performance Computing, National University of Denfense Technology [3]School of Computer, National University of Denfense Technology
出 处:《Chinese Journal of Electronics》2018年第3期540-548,共9页电子学报(英文版)
基 金:the National Key Research and Development Program of China(No.2016YFB0200703);the National Natural Science Foundation of China(No.61379056)
摘 要:We accelerate a double precision Alternating direction implicit(ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house Computational fluid dynamics(CFD) software on the latest multi-core and many-core architectures(Intel Sandy Bridge CPUs, Intel Many integrated core(MIC) coprocessors and NVIDIA Kepler K20c GPUs). Some performance optimization techniques are detailed discussed. We provide an in-depth analysis on the performance difference between Sandy Bridge and MIC. Experimental results show that the proposed GPU-enabled ADI solver can achieve a speedup of 5.5 on a Kepler GPU in contrast to two Sandy Bridge CPUs and our optimization techniques can improve the performance of the ADI solver by 2.5-fold on two Sandy Bridge CPUs and 1.7-fold on an Intel MIC coprocessor. We perform a cross-platform performance analysis(between GPU and MIC), which serves as case studies for developers to select the right accelerators for their target applications.We accelerate a double precision Alternating direction implicit(ADI) solver for three-dimensional compressible Navier-Stokes equations from our in-house Computational fluid dynamics(CFD) software on the latest multi-core and many-core architectures(Intel Sandy Bridge CPUs, Intel Many integrated core(MIC) coprocessors and NVIDIA Kepler K20c GPUs). Some performance optimization techniques are detailed discussed. We provide an in-depth analysis on the performance difference between Sandy Bridge and MIC. Experimental results show that the proposed GPU-enabled ADI solver can achieve a speedup of 5.5 on a Kepler GPU in contrast to two Sandy Bridge CPUs and our optimization techniques can improve the performance of the ADI solver by 2.5-fold on two Sandy Bridge CPUs and 1.7-fold on an Intel MIC coprocessor. We perform a cross-platform performance analysis(between GPU and MIC), which serves as case studies for developers to select the right accelerators for their target applications.
关 键 词:Multi-block Structured grid Alternating direction implicit(ADI) CFD solver Sandy bridge MIC GPU Optimization techniques Performance comparison
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.143.209.210