检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:常龙鑫 虞致国 钟啸宇 顾晓峰 CHANG Longxin;YU Zhiguo;ZHONG Xiaoyu;GU Xiaofeng(Engineering Research Centre of IoT Technology Applications(Ministry of Education),Jiangnan University,Wuxi Jiangsu 214122,China)
机构地区:[1]江南大学物联网技术应用教育部工程研究中心,江苏无锡214122
出 处:《电子器件》2022年第6期1289-1295,共7页Chinese Journal of Electron Devices
基 金:江苏省重点研发计划(BE2019003-2);中央高校基本科研业务费专项资金资助(JUSRP51510)。
摘 要:浮点单元是高性能处理器的速度瓶颈之一,基于广泛应用的开源RISC-V浮点单元原型,设计了一种面向RISC-V处理器的高速浮点单元。对该原型中时序最差的浮点融合乘加、整数转浮点、除法开方子模块分别进行静态时序分析,并定位其中需要优化的关键模块。针对该浮点单元原型中存在的问题,提出基于算法优化和流水线优化的设计思路,设计基4 Booth-Wallace乘法模块替代原有多位宽乘法模块,设计基于二叉树的并行前导零检测模块替代原有串行前导零检测模块,增加了部分子模块的流水线级数。基于SMIC 55 nm工艺对优化设计前后的RISC-V浮点单元原型进行了性能评估,优化后的工作频率达到820 MHz,提升了39.46%,而面积开销增加了15.14%。Floating-point unit is one of the speed bottlenecks of high-performance processor.Based on the widely used open-source RISC-V floating-point unit prototype,a high-speed floating-point unit for RISC-V processor is designed.Static timing analyses are performed for the submodules which have worst timing in prototype,including the floating-point fused multiply-add,integer-to-floating-point,floating-point divider and sqrt submodules.Next,the critical modules most in need of optimization are located.Then,the design methods are proposed based on algorithm optimization and pipeline optimization.A radix-4 Booth-Wallace multiplication module is designed to replace the original multi-bit wide multiplication algorithm.A parallel leading zero detection algorithm based on binary tree is designed to replace the original serial leading zero detection algorithm,and the pipeline stages of some submodules are increased.Based on the SMIC 55 nm technology,the performance of RISC-V floating-point unit prototype before and after the optimization are evaluated.The operating frequency reaches 820 MHz after optimization,increased by 39.46%,and the area is increased by 15.14%.
关 键 词:RISC-V 浮点单元 基4 Booth-Wallace乘法 并行前导零检测
分 类 号:TP302.2[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.135.184.166