专用指令集在基于FPGA的神经网络加速器中的应用  被引量:5

Application of Special Command Set in Neural Network Accelerator Based on FPGA

在线阅读下载全文

作  者:胡航天 刘凯[1] 马士超 郭子博 HU Hangtian;LIU Kai;MA Shichao;GUO Zibo(School of Computer Science and Technology,XIDIAN University,Xi’an 710071,China)

机构地区:[1]西安电子科技大学计算机科学与技术学院,西安710071

出  处:《空间控制技术与应用》2020年第3期36-41,54,共7页Aerospace Control and Application

基  金:国家自然科学基金资助项目(61850410523)。

摘  要:近年来,表现出极其优越性能的神经网络算法对硬件算力的要求逐渐提高.在一些低功耗场景如星载系统中,拥有可编程重构、高并行等特性的FPGA是神经网络算法较为合适的硬件加速平台.为了解决传统神经网络硬件加速器设计中片内资源消耗大、各功能模块耦合性高等问题,设计实现了一套专用AI指令集并应用在了基于FPGA的神经网络加速器的设计中.文章首先介绍了该指令集的设计方案.整个指令集由指令寄存器、指令解释器、指令转发模块、内存管理单元和多个模块构成.通过该指令集可实现对不同模块的复用,降低模块之间的耦合性.并以YOLOV3-Tiny网络模型为例,对比了平铺式和指令控制式两种加速方案的逻辑资源的消耗.验证了应用专用指令集可以减少约50%的FPGA逻辑资源的使用.In recent years,the requirements of hardware computing power for neural network algorithms that showing extremely superior performance have gradually become higher.In some low-power scenarios such as spaceborne systems,FPGAs with low power consumption and high parallelism are the most suitable hardware acceleration platforms for neural network algorithms.In order to solve the problems of high on-chip resource consumption and high coupling of various operation modules in hardware structure design,a set of dedicated instruction set is designed and implemented to the structural design of FPGA-based neural network accelerator.Firstly,the design and application of the instruction set are introduced.The whole system is composed of instruction register,instruction interpreter,instruction forwarding module,memory management unit,and multiple execution modules.The system can realize the multiplexing of different operation modules and reduce the coupling between modules.Afterwards,the YOLOV3-Tiny network model is used as an example to compare the on-chip resource consumption of two acceleration schemes,tiled and command-controlled.It is verified that the application of dedicated instruction set can effectively reduce the use of FPGA on-chip resources.

关 键 词:指令集 神经网络 FPGA 

分 类 号:TP302.1[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象