GPU集群加速近似逆预条件CG并行求解器  被引量:1

Approximate Inverse Preconditioned CG Parallel Solver on GPU Cluster

在线阅读下载全文

作  者:赵莲[1,2] 赵永华[1] 陈尧[1,2] 赵慰[1,2] 

机构地区:[1]中国科学院计算机网络信息中心,北京100190 [2]中国科学院大学,北京100190

出  处:《计算机科学与探索》2015年第9期1084-1092,共9页Journal of Frontiers of Computer Science and Technology

基  金:国家重点基础研究发展计划(973计划)No.2011CB309702;数学工程与先进计算国家重点实验室开放基金No.2014A03~~

摘  要:针对GPU集群系统,研究了分解近似逆(approximate inverse,AINV)和对称逐次超松弛-近似逆(symmetric successive over relaxation approximate inverse,SSOR-AI)两类近似逆预条件的并行算法。采用多级k-路图划分方法,通过子图的内点和边界点识别方法以及稀疏矩阵的置换技术,提出了将稀疏矩阵转换为分块箭形矩阵的并行方法。基于所形成的分块箭形矩阵,结合块内稀疏矩阵近似逆串行、块间并行的策略给出了近似逆预条件的并行方法,实现了AINV和SSOR-AI并行算法,解决了AINV预条件难以并行的问题。基于CPU与GPU协同计算、主机端页锁定内存和设备端计算与通信重叠的优化技术,实现了并行近似逆预条件与共轭梯度(conjugate gradient,CG)算法相结合的线性方程组混合并行求解器。数值实验表明,所提方法对AINV和SSOR-AI两类近似逆预条件,在多GPU上获得了很好的可扩展性和加速效果。This paper shows the study on the parallel algorithm of AINV(approximate inverse) and SSOR-AI(symmetric successive over relaxation approximate inverse) preconditioners on GPU cluster systems. With multilevel kway graph partitioning, this paper proposes the parallel method which can transform a sparse matrix into block arrow form based on a method to identify interior/boundary vertex of subgraphs and a permutation. Based on the block arrow matrix, with the strategy of sequential computation approximate inverse of inner block and parallel computation between the different blocks, the parallel algorithm of AINV and SSOR- AI is obtained. Based on the optimization techniques of collaborative computing between CPU and GPU, page- locked host memory and overlapping transfers with computation on device, this paper combines parallel approximate inverse preconditioner with CG(conjugate gradient) algorithm to obtain a hybrid parallel solver for linear systems. Numerical experiments indicate that applying the above methods can obtain very good acceleration effect and scalability both AINV parallel implementation and SSOR-AI parallel implementation on cluster-GPU.

关 键 词:近似逆 预条件 迭代法 异构并行计算 GPU集群 

分 类 号:TP302[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象