检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:李宁 肖昊 Li Ning;Xiao Hao(School of Microelectronics,Hefei University of Technology,Hefei 230601,China)
出 处:《电子测量技术》2024年第5期1-8,共8页Electronic Measurement Technology
基 金:国家自然科学基金(61974039)项目资助。
摘 要:剪枝是一种减少卷积神经网络权重和计算量的有效方法,为CNN的高效部署提供了解决方案。但是,剪枝后的稀疏CNN中权重的不规则分布使硬件计算单元之间的计算负载各不相同,降低了硬件的计算效率。文章提出一种细粒度的CNN模型剪枝方法,该方法根据硬件加速器的架构将整体权重分成若干个局部权重组,并分别对每一组局部权重进行独立剪枝,得到的稀疏CNN在加速器上实现了计算负载平衡。此外,设计一种具有高效PE结构和稀疏度可配置的稀疏CNN加速器并在FPGA上实现,该加速器的高效PE结构提升了乘法器的吞吐率,同时可配置性使其可灵活地适应不同稀疏度的CNN计算。实验结果表明,提出的剪枝算法可将CNN的权重参数减少50%~70%,同时精度损失不到3%。相比于密集型加速器,提出的加速器最高可实现3.65倍的加速比;与其他的稀疏型加速器研究相比,本研究的加速器在硬件效率上提升28%~167%。Pruning is an effective approach to reduce weight and computation of convolutional neural network,which provides a solution for the efficient implementation of CNN.However,the irregular distribution of weight in the pruned sparse CNN also makes the workloads among the hardware computing units different,which reduces the computing efficiency of the hardware.In this paper,a fine-grained CNN model pruning method is proposed,which divides the overall weight into several local weight groups according to the architecture of the hardware accelerator.Then each group of local weights is pruned independently respectively,and the sparse CNN obtained is workload-balancing on the accelerator.Furthermore,a sparse CNN accelerator with efficient PE and configurable sparsity is designed and implemented on FPGA.The efficient PE improves the throughput of the multiplier,and the configurability makes it flexible to compute CNN with different sparsity.Experimental results show that the presented pruning algorithm can reduce the weight parameters of CNN by 70%and the accuracy loss is less than 3%.Compared to dense accelerator research,the accelerator proposed in this paper achieves up to 3.65x speedup.The accelerator improves the hardware efficiency by 28~167%compared with other sparse accelerators.
分 类 号:TP302[自动化与计算机技术—计算机系统结构]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222