基于注意力残差网络的人脸超分辨率重建  被引量:1

Face Super-Resolution Reconstruction Based on Attention Residual Network

在线阅读下载全文

作  者:王同官 赖惠成[1] 蔡玉玺 高古学 汪烈军 WANG Tongguan;LAI Huicheng;CAI Yuxi;GAO Guxue;WANG Liejun(College of Information Science and Engineering,Xinjiang University,Urumqi 830046,China)

机构地区:[1]新疆大学信息科学与工程学院,乌鲁木齐830046

出  处:《计算机工程》2023年第6期234-241,共8页Computer Engineering

基  金:国家自然科学基金(U1903213)。

摘  要:为解决通道内部特征信息交互性不足、特征利用和表示不够充分导致的人脸面部细节信息恢复不理想的问题,提出一种基于编码器-解码器的注意力残差网络,并设计基于注意力的残差模块,其主要由基准残差模块、沙漏模块与内部特征拆分注意力模块组成,通过内部特征拆分注意力模块加强通道内部之间的交互性,使网络能够提取到更详细的特征信息,恢复出更多人脸面部细节,同时在残差模块中利用一个预激活模块,解决批量归一化层在超分辨率网络中存在的伪影问题。在特征提取单元末端运用多阶特征融合模块充分融合多个阶段的特征,缓解特征在网络传输过程中的丢失现象,提高特征利用率。实验结果表明,该方法可以恢复出更多人脸面部细节,在Helen人脸数据集上,重建人脸图像的PSNR值为27.74 dB,相比SISN和DICNet方法,分别提高了1.47 dB、1.12 dB。在CelebA人脸数据集上,重建人脸图像的PSNR值为27.40 dB,相比SISN和DICNet方法,分别提高了1.26 dB、0.39 dB。Insufficient interactivity of feature information inside the channel and insufficient feature utilization and representation leads to less than ideal recovery of facial information.To address the problem,this paper proposes an attentional residual network based on encoder-decoder.In particular,it proposes a new Residual Attention Block(RAB),which mainly consists of a baseline residual block,an hourglass block,and an internal-feature split attention block used to strengthen the interactivity between channel interiors.This enables the network to extract more detailed feature information and recover more facial details.In addition,in the proposed residual block,a pre-activation block is used to solve the artifact problem that occurs in the Batch Normalization(BN)layer in super-resolution networks.Finally,a multi-stage feature fusion block is used at the end of the feature extraction unit to fully fuse the features obtained at different stages,which alleviates the feature loss during network transmission and improves the feature utilization.Experimental results show that the proposed method can recover more facial details.On the Helen face dataset,the PSNR value of face image reconstructed by this algorithm is 27.74 dB,which is 1.47 dB and 1.12 dB higher than that of SISN and DICNet methods,respectively.Similarly,on the CelebA face dataset,the PSNR value of face image reconstructed by the proposed algorithm is 27.40 dB,which is 1.26 dB and 0.39 dB higher than that of SISN and DICNet methods,respectively.

关 键 词:人脸超分辨率 注意力机制 残差网络 特征融合 编码器 解码器 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象