基于面向对象对CUDA架构的LBM计算程序优化

Optimization of LBM computing program based on object-oriented CUDA architecture

作　　者：韦华健张乾毅张静静李华兵[1] WEI Huajian;ZHANG Qianyi;ZHANG Jingjing;LI Huabing(School of Materials Science and Engineering,Guilin University of Electronic Technology,Guilin 541004,China)

机构地区：[1]桂林电子科技大学材料科学与工程学院,广西桂林541004

出　　处：《桂林电子科技大学学报》2024年第6期579-584,共6页Journal of Guilin University of Electronic Technology

基　　金：国家自然科学基金(11362005)。

摘　　要：晶格玻尔兹曼方法(LBM)是一种新颖而有前途的计算流体力学方法,从算法的角度看,其迭代过程能被分化为多个子问题的并行程序,非常适合在高性能图像处理器(GPU)计算,获得极快的数据处理速度,同时有大量工作报告了基于GPU计算的LBM方法得到了高效实现。程序环境以C++编程语言,运用面向对象思想优化CUDA程序结构,可减少程序的耦合性,赋予程序的可持续发展能力;使用Poiseuille flow模型验证优化程序的稳定性与准确性。在程序运行过程中,调用CUDA内核函数来处理模型内的碰撞、迁徙流动、计算宏观量的迭代过程,同时使用共享内存储存GPU运行时的数据,以提高计算效率。数据分析结果表明,计算速度较中央处理器(CPU)提升了70倍,这归功于GPU高性能的并行计算能力。Lattice Boltzmann method(LBM)is a novel and promising computational fluid dynamics method,which has natural advantages.From the perspective of algorithm,the iterative process can be divided into parallel programs with multiple subproblems.In order to obtain extremely fast data processing speed,the iterative process is computed by high-performance graphics processing unit(GPU).At the same time,the efficient implementation of GPU-based LBM method has been widely reported,so it is very suitable for high performance image processor(GPU)calculation to obtain extremely fast data processing speed.The program environment is C++as the programming language,the CUDA program structure is optimized by object-oriented thinking,the coupling of the program is reduced,and the sustainable development of the program is endowed.Poiseuille flow model is used to verify the stability and accuracy of the optimization program.During the program running,CUDA kernel functions are called to deal with the collision within the model,migration flow and iterative process of calculating macro quantities.Meanwhile,shared memory is used to store GPU runtime data to improve computing efficiency.Analysis of the data show that computing speeds are up to 70 times faster than those of the central processing unit(CPU),thanks to the GPU's high-performance parallel computing capabilities.

关键词：晶格玻尔兹曼方法面向对象 Poiseuille flow模型 CUDA

分类号：O414.2[理学—理论物理]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于面向对象对CUDA架构的LBM计算程序优化

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于面向对象对CUDA架构的LBM计算程序优化

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索