检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:韦华健 张乾毅 张静静 李华兵[1] WEI Huajian;ZHANG Qianyi;ZHANG Jingjing;LI Huabing(School of Materials Science and Engineering,Guilin University of Electronic Technology,Guilin 541004,China)
机构地区:[1]桂林电子科技大学材料科学与工程学院,广西桂林541004
出 处:《桂林电子科技大学学报》2024年第6期579-584,共6页Journal of Guilin University of Electronic Technology
基 金:国家自然科学基金(11362005)。
摘 要:晶格玻尔兹曼方法(LBM)是一种新颖而有前途的计算流体力学方法,从算法的角度看,其迭代过程能被分化为多个子问题的并行程序,非常适合在高性能图像处理器(GPU)计算,获得极快的数据处理速度,同时有大量工作报告了基于GPU计算的LBM方法得到了高效实现。程序环境以C++编程语言,运用面向对象思想优化CUDA程序结构,可减少程序的耦合性,赋予程序的可持续发展能力;使用Poiseuille flow模型验证优化程序的稳定性与准确性。在程序运行过程中,调用CUDA内核函数来处理模型内的碰撞、迁徙流动、计算宏观量的迭代过程,同时使用共享内存储存GPU运行时的数据,以提高计算效率。数据分析结果表明,计算速度较中央处理器(CPU)提升了70倍,这归功于GPU高性能的并行计算能力。Lattice Boltzmann method(LBM)is a novel and promising computational fluid dynamics method,which has natural advantages.From the perspective of algorithm,the iterative process can be divided into parallel programs with multiple subproblems.In order to obtain extremely fast data processing speed,the iterative process is computed by high-performance graphics processing unit(GPU).At the same time,the efficient implementation of GPU-based LBM method has been widely reported,so it is very suitable for high performance image processor(GPU)calculation to obtain extremely fast data processing speed.The program environment is C++as the programming language,the CUDA program structure is optimized by object-oriented thinking,the coupling of the program is reduced,and the sustainable development of the program is endowed.Poiseuille flow model is used to verify the stability and accuracy of the optimization program.During the program running,CUDA kernel functions are called to deal with the collision within the model,migration flow and iterative process of calculating macro quantities.Meanwhile,shared memory is used to store GPU runtime data to improve computing efficiency.Analysis of the data show that computing speeds are up to 70 times faster than those of the central processing unit(CPU),thanks to the GPU's high-performance parallel computing capabilities.
关 键 词:晶格玻尔兹曼方法 面向对象 Poiseuille flow模型 CUDA
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28