申威异构众核处理器架构下结构瞬态有限元并行算法  

Parallel algorithms for structure transient analysis based on heterogeneous multi-core processor architecture

在线阅读下载全文

作  者:喻高远 楼云锋[1,2,3] 李俊杰 金先龙 YU Gaoyuan;LOU Yunfeng;LI Junjie;JIN Xianlong(State Key Laboratory of Mechanical System and Vibration,Shanghai Jiao Tong University,Shanghai 200240,China;School of Mechanical Engineering,Shanghai Jiao Tong University,Shanghai 200240,China;Aerospace System Engineering Shanghai,Shanghai 201108,China)

机构地区:[1]上海交通大学机械系统与振动国家重点实验室,上海200240 [2]上海交通大学机械与动力工程学院,上海200240 [3]上海宇航系统工程研究所,上海201108

出  处:《振动与冲击》2023年第6期152-158,共7页Journal of Vibration and Shock

基  金:国家自然科学基金(11772192)。

摘  要:根据国产申威异构众核分布式存储计算机的体系结构特点,提出了一种结构瞬态有限元分层并行计算方法,对于提高国产申威异构众核分布式存储并行计算机下大型、超大型复杂结构系统的瞬态并行求解效率具有重要意义。该方法在分层通信和Newmark-HHT算法的基础上构建了大规模复杂结构系统的瞬态并行求解体系,不仅实现了计算过程中大量数据的分布式存储,显著改善了数据的内存访存效率;而且实现了计算过程的两层并行,有效改善了通信效率。因此,该计算方法能够充分利用国产申威异构众核分布式存储并行计算机的体系结构特点提升结构瞬态大规模并行计算效率。最后通过典型数值算例验证了该方法的正确性和有效性,并将其应用于某高层建筑,实现其上千万自由度、数万核的结构瞬态并行计算。According to the architecture characteristics of the domestic heterogeneous multi-core processor,a hierarchical communication parallel computing algorithm for structural transient analysis was proposed,which had important significance for improving the parallel efficiency of the system transient analysis on the entire large structure under the domestic heterogeneous multi-core and distributed memory parallel computers.Based on hierarchical communication and the Newmark-HHT algorithm,a parallel computing system for a large-scale transient analysis was established,which could not only significantly improve the memory access rate through the distributed storage of a large amount of data,but also significantly improve the communication rate with the two-layer parallelization of the computational procedure.It is shown that the method can improve the efficiency rates of parallel computing of the large-scale transient analysis by fully exploiting the architecture characteristics of the domestic heterogeneous multi-core and distributed memory parallel computers.Finally,typical numerical experiments were used to validate the correctness and efficiency of the proposed method.Then,the parallel transient analysis of a high-rise building with over ten-million-DOF was performed and ten thousands of core processors were applied.

关 键 词:异构众核 分布式存储 分层通信 大规模瞬态分析 并行计算 

分 类 号:TP301.6[自动化与计算机技术—计算机系统结构] O241.6[自动化与计算机技术—计算机科学与技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象