检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张慧明 ZHANG Huiming(VeriSilicon Microelectronics Shanghai Co.,Ltd,Shanghai 201203,China)
机构地区:[1]芯原微电子(上海)股份有限公司,上海201203
出 处:《集成电路应用》2020年第5期10-13,共4页Application of IC
基 金:上海市高新技术企业科技创新课题项目。
摘 要:分析表明,为了实现卷积神经网络的加速,经常通过并行排列多个卷积单元来实现。在理想情况下,卷积单元越多,处理速度越快。但是在实际应用中,数据带宽会大大限制卷积单元的处理速度,硬件的带宽资源非常珍贵,提高硬件的数据带宽代价巨大。因此,在有限的数据带宽和硬件开销下,提高卷积神经网络的处理速度,成为当前硬件架构设计急需解决的问题。鉴于以上所述现有技术的缺点,提供一种基于多核的卷积神经网络加速方法及系统、存储介质及终端,通过多个并行的卷积核节省卷积神经网络的数据带宽。在相同硬件数据带宽条件下,在卷积核中并行的向量点积运算提高卷积神经网络的处理速度。At present,in order to accelerate the convolution neural network,it is often realized by arranging several convolution units in parallel.In ideal case,the more convolution units,the faster processing speed.But in practical application,data bandwidth will greatly limit the processing speed of convolution unit,hardware bandwidth resources are very precious,and the cost of improving hardware data bandwidth is huge.Therefore,under the limited data bandwidth and hardware overhead,improving the processing speed of convolutional neural network becomes an urgent problem in the current hardware architecture design.In view of the disadvantages of the prior art,the invention aims to provide a multi-core based convolutional neural network acceleration method and system,storage medium and terminal,and save the data bandwidth of the convolutional neural network through multiple parallel convolution cores;under the same hardware data bandwidth,improve the convolutional neural network by parallel vector dot product operation in the convolution core processing speed.
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.22.70.233