检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨子江 张克龙 刘倩[1,2] 徐顺 孙鹏[3] YANG Zi-Jiang;ZHANG Ke-Long;LIU Qian;XU Shun;SUN Peng(Computer Network Information Center,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China;Nanjing Normal University,Nanjing 210023,China)
机构地区:[1]中国科学院计算机网络信息中心,北京100190 [2]中国科学院大学,北京100049 [3]南京师范大学,南京210023
出 处:《计算机系统应用》2022年第11期358-364,共7页Computer Systems & Applications
基 金:中国科学院B类先导培育项目(XDPB25);海光产业生态合作组织基金(ghfund202107011598)
摘 要:格点量子色动力学(格点QCD)是研究夸克、胶子等微观粒子间相互作用的重要理论和方法.通过将时空离散化为四维结构网格,并将量子色动力学的基本场量定义在网格上,让研究人员可以使用数值模拟方法,从第一性原理出发研究强子间相互作用和性质,但这个过程中的计算量极大,需要进行大规模并行计算.格点QCD计算的核心基础为格点QCD求解器,是程序运行主要的计算热点模块.本文研究在国产异构计算平台下格点QCD求解器的实现与优化,提出一套格点QCD求解器的设计实现,实现了BiCGSTAB求解器,显著降低了迭代次数;通过对奇偶预处理技术,降低了所求问题的计算规模;针对国产异构加速卡的特点,优化了Dslash模块的访存操作.实验测试表明,相比优化前的求解器获得了约30倍的加速比,为国产异构超算下格点QCD软件性能优化提供了有益的参考价值.Lattice quantum chromodynamics(Lattice QCD)is an important theory and method to study the interaction between microscopic particles such as quarks and gluons.By discretizing the spacetime into a four-dimensional structural grid and defining the basic field quantity of QCD on the grid,researchers can use a numerical simulation method to study hadron interactions and properties from the first principle.However,the computation in this process is time-consuming,and large-scale parallel computing is required.The fundamental module of the Lattice QCD computation is the Lattice QCD solver which is the main hot spot of the program running.This work studies the realization and optimization of Lattice QCD solver from a domestic heterogeneous computing platform and proposes a design method of Lattice QCD solver,which realizes BiCGSTAB solver and significantly reduces the iteration numbers.With the odd/even preprocessing technology,the study reduces the computing scale of the problem and optimizes the Dslash module’s memory access in terms of the characteristics of a domestic heterogeneous accelerator.Experimental tests show that the speedup ratio of the solver is about 30 times higher than that of the unoptimized one,which provides a useful reference for the performance optimization of Lattice QCD software of domestic heterogeneous supercomputers.
关 键 词:格点量子色动力学 方程求解器 并行计算 异构计算
分 类 号:O572.243[理学—粒子物理与原子核物理]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.38