检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:肖国麟 杨春玲[1] 陈宇[1] Xiao Guolin;Yang Chunling;Chen Yu(School of Electrical Engineering and Automation,Harbin Institute of Technology,Harbin 150001,China)
机构地区:[1]哈尔滨工业大学电气工程及自动化学院,黑龙江哈尔滨150001
出 处:《电子技术应用》2020年第10期39-41,共3页Application of Electronic Technique
摘 要:传统的卷积神经网络量化算法广泛使用对称均匀量化操作对模型权值进行量化,没有考虑到相邻权值量化之间的相互关系,即上一个权值的量化操作产生的量化噪声可以通过调整之后权值的量化方向加以弥补。针对上述问题,提出了一种基于权值交互思想的三值卷积神经网络量化算法,达到了16倍的模型压缩比,以ImageNet作为数据集,量化后的AlexNet和ResNet-18网络上模型预测准确率只下降了不到3%。该方法达到了较高的模型压缩比,具有较高的精度,可以用于将卷积神经网络移植到计算资源有限的移动端平台上。Traditional convolutional neural network quantization algorithms widely use symmetric uniform quantization operations to quantize models ′ weights, without taking into account the correlation between the quantization of adjacent weights, that is, the quan-tization noise generated by the quantization operation of the previous weight can be made up after adjusting the quantitative direc-tion of the next weights. Aiming at the above problems, a ternary convolutional neural network quantization algorithm based on the idea of weight interaction is proposed, the model compression ratio is 16 times. On the ImageNet dataset, the model prediction ac-curacy of ternarized AlexNet and ResNet-18 network only decrease less than 3 %. This method achieves a high model compression ratio, has higher accuracy, and can be used to transplant convolutional neural networks to mobile platforms with limited computing resources.
分 类 号:TN911.73[电子电信—通信与信息系统] TP391[电子电信—信息与通信工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28