检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:金洲 段懿洳 伊恩鑫 戢昊男 刘伟峰[1] JIN Zhou;DUAN Yiru;YI Enxin;JI Haonan;LIU Weifeng(College of Information Science and Engineering,China University of Petroleum,Beijing 102249,China)
机构地区:[1]中国石油大学(北京)信息科学与工程学院,北京102249
出 处:《国防科技大学学报》2022年第5期80-91,共12页Journal of National University of Defense Technology
基 金:国家自然科学基金资助项目(61972415);计算机体系结构国家重点实验室开放课题资助项目(CARCHA202115)。
摘 要:规约与扫描是并行计算中的核心原语,其并行加速至关重要。然而,冯·诺依曼体系结构下无法避免的数据移动使其面临“存储墙”等性能与功耗瓶颈。近来,基于ReRAM等非易失存储器的存算一体架构支持的原位计算可一步实现矩阵-向量乘,已在机器学习与图计算等应用中展现了巨大的潜力。提出面向忆阻器存算一体架构的规约与扫描的并行加速方法,重点阐述基于矩阵-向量乘运算的计算流程和在忆阻器架构上的映射方法,实现软硬件协同设计,降低功耗并提高性能。相比于GPU,所提规约与扫描原语可实现高达两个数量级的加速,平均加速比也可达到两个数量级。分段规约与扫描最大可达到五个(平均四个)数量级的加速,并将功耗降低79%。Reduction and scan are two critical primitives in parallel computing.Thus,accelerating reduction and scan shows great importance.However,the Von Neumann architecture suffers from performance and energy bottlenecks known as“memory wall”due to the unavoidable data migration.Recently,NVM(non-volatile memory)such as ReRAM(resistive random access memory),enables in-situ computing without data movement and its crossbar architecture can perform parallel GEMV(matrix-vector multiplication)operation naturally in one step.ReRAM-based architecture has demonstrated great success in many areas,e.g.accelerating machine learning and graph computing applications,etc.Parallel acceleration methods were proposed for reduction and scan primitives on ReRAM-based PIM(processing in memory)architecture,the computing process in terms of GEMV and the mapping method on the ReRAM crossbar were focused,and the co-design of software and hardware was realized to reduce power consumption and improve performance.Compared with GPU,the proposed reduction and scan algorithm achieved substantial speedup by two orders of magnitude,and the average acceleration ratio can also reach two orders of magnitude.The case of segmentation can achieve up to five(four on average)orders of magnitude.Meanwhile,the power consumption decreased by 79%.
分 类 号:TN95[电子电信—信号与信息处理]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.117.114.211