基于自注意力机制增强的深度学习图像压缩被引量：3

A Self-attention Mechanism Augmented Deep Learning Model for Images Compression

作　　者：展亚南施晓东[1] 孙镱诚丁阳杨万扣[2] ZHAN Ya-nan;SHI Xiao-dong;SUN Yi-cheng;DING Yang;YANG Wan-kou(The 28th Research Institute of China Electronics Technology Group Corporation,Nanjing 210007,China;School of Automation,Southeast University,Nanjing 211189,China)

机构地区：[1]中国电子科技集团公司第二十八研究所,江苏南京210007 [2]东南大学自动化学院,江苏南京211189

出　　处：《控制工程》2022年第3期536-541,共6页Control Engineering of China

基　　金：装备预先研究项目(301021302)。

摘　　要：提出了一种基于自注意力机制增强的深度学习模型,用于无人机侦察图像的压缩与解压。与现有方法相比,提出的深度学习模型有两个显著特点:其一,模型由四部分组成(编码器、二值化器、量化器和解码器),并且可以通过端到端的优化提高模型的压缩和解压效率;其二,量化器是基于自注意力机制增强的多层前馈神经网络,它能充分利用图像的上下文信息对图像进行压缩。在公开数据集Kodak和Tecnick的实验结果表明,提出模型的压缩率-保真率曲线优于传统的图像压缩标准和现有的深度学习模型。对于常规大小的图像,在保持图像质量MS-SSIM为85%~95%的前提下,图像压缩比BPP能达到7%~15%,并且在普通CPU上其处理速度达0.48秒/张,能显著降低影像的数据大小且不牺牲处理速度。A self-attention mechanism augmented deep learning model is proposed to compress and decompress the UAV reconnaissance image in this paper.Compared with the existing methods,the proposed deep learning model in this paper has two significant characteristics.Firstly,the model consists of four parts(encoder,binarizer,quantizer and decoder),and the compression and decompression efficiency of the model can be improved through end-to-end optimization.Secondly,the quantizer is a self-attention mechanism augmented multi-layer feedforward neural network,which can make full use of the context information to compress the image.Experimental results on public data sets such as Kodak and Tecnick show that the Bits Per Pixel-Peak Signalto Noise Ratial(BPP-PSNR)curve of the proposed model is better than that of the traditional image compression standards and existing deep learning models.For images with commonly used size,the compression ratio of the model,e.g.BPP,can reach 7%~15%while maintaining the MS-SSIM of 85%~95%,and the processing speed can reach 0.48 s/sheet on the ordinary CPU.The model proposed in this paper can significantly reduce the data size of the compressed image without sacrificing the processing speed.

关键词：图像压缩深度学习自注意力机制端到端多层前馈神经网络

分类号：TP18[自动化与计算机技术—控制理论与控制工程]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于自注意力机制增强的深度学习图像压缩被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于自注意力机制增强的深度学习图像压缩 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于自注意力机制增强的深度学习图像压缩被引量：3