检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:朱凯 李理[1,2] 张彤 江晟 别一鸣 ZHU Kai;LI Li;ZHANG Tong;JIANG Sheng;BIE Yiming(School of Physics,Changchun University of Science and Technology,Changchun 130022,Jilin,China;School of Electronical and Information Engineering,Changchun University of Science and Technology,Changchun 130022,Jilin,China;School of Transportation,Jilin University,Changchun 130022,Jilin,China)
机构地区:[1]长春理工大学物理学院,吉林长春130022 [2]长春理工大学电子信息工程学院,吉林长春130022 [3]吉林大学交通学院,吉林长春130022
出 处:《计算机工程》2024年第9期276-285,共10页Computer Engineering
基 金:吉林省科技发展计划重点研发项目(20210203214SF)。
摘 要:运动模糊是导致图像退化的常见原因,其限制了图像的可读性和后续处理效果。针对卷积网络感受野有限以及常规多阶段网络中信息丢失的问题,提出一种基于Transformer的多阶段去模糊网络。网络采用多阶段编码器-解码器结构,在单个阶段内和多个阶段间采用跳跃连接来增强信息的传递。首先,高效Transformer模块采用通道注意力和深度卷积来处理图像的全局和局部信息;其次,多分支结构的前馈传播网络通过引入多个并行的分支,实现了不同尺度和不同层次的特征提取和融合;最后,通过多阶段的残差处理实现更优的图像恢复结果。实验结果显示,在GoPro数据集上该网络的峰值信噪比(PSNR)达到32.23 dB,结构相似性指数(SSIM)达到0.955,在HIDE数据集上PSNR和SSIM分别达到30.15 dB和0.930,优于DeepDeblur、DeblurGAN-V2等模型。Motion blur is a common cause of image degradation that limits image readability and subsequent processing.A multi-stage deblurring network based on the Transformer is proposed to address the limited receptive field of convolutional networks and information loss in conventional multi-stage networks.The network adopts a multi-stage encoder-decoder structure with skip connections within and between stages to enhance information propagation.First,an efficient Transformer module is used to process the global and local information of the image using channel attention and depthwise convolution.Second,a multi-branch feedforward network with multiple parallel branches is introduced to extract and fuse features at different scales and levels.Finally,superior image restoration results are achieved through multi-stage residual learning.Experimental results show that the proposed method achieves a Peak Signal-to-Noise Ratio(PSNR)of 32.23 dB and Structural Similarity Index Measure(SSIM)of 0.955 on the GoPro dataset,and a PSNR of 30.15 dB and SSIM of 0.930 on the HIDE dataset,demonstrating a performance superior to DeepDeblur,DeblurGAN-V2,and other models.
关 键 词:深度学习 Transformer模型 注意力机制 图像修复 多尺度网络
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.141.6.24