检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张元胤 肖敏广 刘志勇[1] 翁灵玲 陈志广 卢宇彤[1] ZHANG Yuanyin;XIAO Minguang;LIU Zhiyong;WENG Lingling;CHEN Zhiguang;LU Yutong(School of Computer Science and Engineering,Sun Yat-sen University,Guangzhou 511400,China)
出 处:《计算机工程与科学》2025年第2期200-209,共10页Computer Engineering & Science
基 金:国家重点研发计划(2021YFB0300103);国家自然科学基金(62272499);广东特支计划(2021TQ06X160)。
摘 要:MT-3000是由国防科技大学面向下一代超级计算机设计的国产异构众核处理器,具有优越的计算能力,可以有效加速可视化数据处理。等值线和等值面提取是标量场数据最常用的几何可视化方法,但现有的提取算法通常仅面向通用CPU或GPU。在MT-3000处理器上,由于片上缓存空间有限,从核访存带宽限制等问题,导致计算效率低下;另外,由于编程模型的特殊性,现有软件与方法无法直接在MT-3000上运行。为了充分发挥国产超算系统在可视化领域的计算效能,基于MT-3000的微体系结构对等值线网格序列算法和等值面移动立方体算法分别提出了新的并行化算法。新方法采用向量指令、流水线实现存算重叠等技术,更加适应异构众核架构,从而达到加速算法执行的目的。实验结果表明,2种算法的加速比均达到4以上,并且随着从核的增多,算法的执行时间近呈线性下降,这证明所提算法具有良好的可扩展性。The MT-3000 is a domestic heterogeneous many-core processor designed by the National University of Defense Technology for the next generation of supercomputers.It has superior computing power and can effectively accelerate data processing in visualization.Isoline and isosurface extraction is the most common geometric visualization method for scalar field data.However,existing extraction algorithms typically target general CPUs or GPUs.On MT-3000 processors,the computing efficiency is low due to the limited cache space on-chip,bandwidth throttling of memory access from the cores,etc.In addition,due to the unique nature of programming models,existing software and methods are unable to run on MT-3000 processors directly.In order to fully utilize the computational efficiency of the domestic supercomputing systems in the field of visualization,this paper implements a new parallelization algorithm of the grid scan algorithm for isoline extraction and the marching cubes algorithm for isosurface extraction based on the hardware characteristics of MT-3000.Techniques such as vector instructions and pipeline implementation are used to better adapt to the many-core architecture,thus achieving the goal of improving performance.The experimental results show a speedup of over 4,and the execution time of both the algorithms decreases nearly linearly while increasing cores,which proves the scalability of the algorithms.
关 键 词:数据过滤 等值线 等值面 并行计算 异构 众核 国产超算系统
分 类 号:TP393[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.148.200.110