检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:陈正博 吴铁彬 郑方[1] 丁亚军[1] CHEN Zheng-bo;WU Tie-bin;ZHENG Fang;DING Ya-jun(Jiangnan Institute of Computing Technology,Wuxi 214000,China)
机构地区:[1]江南计算技术研究所
出 处:《计算机技术与发展》2019年第8期96-101,共6页Computer Technology and Development
基 金:国家重点研发计划项目(2016YFB0200501)
摘 要:近年来,面向人工智能领域的芯片快速发展,低精度和混合精度的乘加运算能力是人工智能芯片计算能力的核心指标,同时乘加部件也是人工智能芯片功率的主要消费者。面向人工智能领域应用需求,研究高性能、低能耗、低开销的浮点乘加器,对人工智能芯片的研发具有重要意义。文中设计了一种面向AI的浮点乘加器,支持单精度、半精度、单半混合精度的浮点乘加运算,也支持32位、16位和8位的整数乘法运算。该部件采用跨精度复用的设计思想,提出乘法器复用、移位器复用、前导零预测器复用等关键技术,在保证各类操作功能和性能的基础上,有效减少了芯片面积和功耗。文中完成了该部件的正确性测试和物理综合。实验结果表明,该部件能满足正确性要求,在28nm工艺条件下,对比无复用设计至少减少50.09%的面积和47.91%的功耗,综合运行频率达到2GHz。In recent years,chips in the field of artificial intelligence have been developing rapidly.The multiply-add operation with low precision and mixed precision is the core index of the computing power of artificial intelligence chips,and the multiply-add components are also the main consumers of the power of artificial intelligence chips.It is of great significance for the research and development of artificial intelligence chips to study floating point multiplicator with high performance,low energy consumption and low overhead for the application demand in the field of artificial intelligence.In this paper,an AI-oriented floating point multiplicator is designed,which supports the floating point multiplication and addition operations with single precision,half precision and single-half-mixed precision,as well as the integer multiplication operations with 32,16 and 8 bits.This design adopts the idea of cross-precision reuse and reduces the area and power of chips with ensuring all kind of operation functions and performance,by using proposed key technology like multiplier-reuse,shifter-reuse,LZA-reuse.We finish the correctness test and physical syntheses.The experiment shows that this architecture is correct in all operations.Compared with the no-reuse design,at least 50.09%of the area and 47.91%of the power consumption can be reduced under the condition of 28 nm process.The comprehensive operating frequency can reach 2 GHz.
关 键 词:人工智能 浮点乘加器 单精度 半精度 单半混合精度
分 类 号:TP302[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.15