检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨智杰 王蕾[1] 石伟[1] 彭凌辉 王耀[1] 徐炜遐[1] Yang Zhijie;Wang Lei;Shi Wei;Peng Linghui;Wang Yao;Xu Weixia(College of Computer Science and Technology,National University of Defense Technology,Changsha 410073)
出 处:《计算机研究与发展》2023年第1期17-29,共13页Journal of Computer Research and Development
基 金:国家重点研发计划项目(2018YFB2202603,2020AAA0104602)。
摘 要:类脑处理器较深度学习处理器具有能效优势.类脑处理器的片上互连一般采用具有可扩展性高、吞吐量高和通用性高等特点的片上网络.为了解决采用同步片上网络面临的全局时钟树时序难以收敛的问题以及采用异步片上网络面临的链路延迟匹配、缺乏电子设计自动化工具实现和验证的问题,提出了一种异步片上网络架构——NosralC,用于构建全局异步局部同步(global asynchronous local synchronous,GALS)的多核类脑处理器.NosralC采用异步链路和同步路由器实现.实验表明,NosralC较同步基线,在4个类脑应用数据集下展现出37.5%~38.9%的功耗降低、5.5%~8.0%的平均延迟降低和36.7%~47.6%的能效提升,同时增加不多于6%的额外资源以及带来较小的性能开销(吞吐量降低0.8%~2.4%).NosralC在现场可编程门阵列(FPGA)上得到了验证,证明了该架构的可实现性.Neuromorphic processors show extremely high energy efficiency advantages over traditional deep learning processors.The network-on-chip with high scalability,high throughput,and high versatility features is generally adopted as the on-chip communication and connection implementation of neuromorphic processors.In order to solve the problems of making the synchronous network-on-chip that adopts the global clock tree to achieve timing closure,matching link delay in the asynchronous network-on-chip,and lacking electronic design automation tools in implementation and verification of asynchronous network-on-chip,we propose a low-power asynchronous network-onchip architecture,NosralC,to build a global-asynchronous-local-synchronous multi-core neuromorphic processor.NosralC is implemented with asynchronous links and synchronous routers.The small amount of asynchronous design makes NosralC similar to the synchronous design and friendly to implementation and validation of asynchronous design using existing electronic design automation tools.Experiments show that compared with a synchronous counterpart baseline with the same function,NosralC achieves 37.5%-38.9%reduction in power consumption,5.5%-8.0%reduction in average latency,and 36.9%-47.6%improvement in energy efficiency in executing the FSDD,DVS128 Gesture,NTI-DIGITS,and NMNIST neuromorphic application datasets while increasing less than6%additional resource overhead and a small amount of performance overhead(0.8%-2.4%throughput decrease).NosralC is verified on the field programmable gate array(FPGA)platform and its implementability is proved.
关 键 词:类脑处理器 片上网络 异步电路 全局异步局部同步 脉冲神经网络
分 类 号:TP389.1[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.185