无网格Galerkin法GPU加速并行计算及其应用被引量：1

Parallel computing and application of Element-Free Galerkin method for GPU acceleration

出　　处：《计算力学学报》2015年第6期745-751,共7页Chinese Journal of Computational Mechanics

基　　金：国家自然科学基金(51375417;51405415)资助项目

摘　　要：针对无网格Galerkin法计算耗时的问题,采用逐节点对法来组装刚度矩阵、共轭梯度法求解基于CSR格式存储的稀疏线性方程组,提出了一种利用罚函数法施加本质边界条件的EFG法GPU加速并行算法,给出了刚度矩阵和惩罚刚度矩阵的统一格式,以及GPU加速并行算法的流程图。编写了基于CUDA构架平台的GPU程序,且在NVIDIA GeForce GTX 660显卡上通过数值算例对所提算法进行了性能测试与分析比较,探讨了影响加速比的因素。算例结果验证了所提算法的可行性,并在满足计算精度的前提下,其加速比最大可达17倍;同时线性方程组的求解对加速比起决定性影响。In order to reduce the computing time of Element-Free Galerkin（EFG） method,a GPU acceleration parallel algorithm of EFG method that essential boundary condition is imposed by penalty function method is proposed, in which stiffness matrix is assembled by node pair-wise approach ,and sparse linear equations based on CSR format is solved by conjugate gradient methods. The unified format of stiffness matrix and penalty stiffness matrix was derived, and the flow chart of the parallel algorithm was provided. The GPU codes were programmed on CUDA,and algorithm testing was finished on the device of NVIDIA GeForce GTX 660 by numerical examples. The factors of affecting speedup ratio were discussed. The example results verified the feasibility of the proposed algorithm. The maximum speedup ratio was up to 17 times on the premise that the calculating accuracy is met,and to solve linear equations is the major factor in the speedup.

关键词：无网格GALERKIN法 GPU加速并行计算 CUDA

分类号：TH123[机械工程—机械设计及理论] O241.82[理学—计算数学]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

无网格Galerkin法GPU加速并行计算及其应用被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

无网格Galerkin法GPU加速并行计算及其应用 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

无网格Galerkin法GPU加速并行计算及其应用被引量：1