多属性无监督人脸风格翻译  

Multi-attribute Unsupervised Face Style Translation

在线阅读下载全文

作  者:朱剑锋 郑熠 廖聪慧 李孝杰 梁梦娇 ZHU Jian-Feng;ZHENG Yi;LIAO Cong-Hui;LI Xiao-Jie;LIANG Meng-Jiao(School of Computer Science,Chengdu University of Information Technology,Chengdu 610225,China;College of Communication Engineering,Chengdu University of Information Technology,Chengdu 610225,China)

机构地区:[1]成都信息工程大学计算机学院,成都610225 [2]成都信息工程大学通信工程学院,成都610225

出  处:《计算机系统应用》2023年第6期12-21,共10页Computer Systems & Applications

基  金:四川省科技厅重点研发计划(2021YFQ0053,2022YFG0152);四川省科技成果转移转化示范项目(2023ZHCG0018);四川省高等教育人才培养质量和教学改革项目(JG2021-1015);成都信息工程大学本科教育教学研究与改革项目暨本科教学工程(JYJG2022131)。

摘  要:针对现有人脸图像翻译模型不能实现多个视觉属性之间的翻译及翻译后的人脸图像不清晰自然的问题,提出了基于人脸识别方法的人脸多属性图像翻译模型.模型主要由内容和风格编码器、AdaIN解码器以及人脸识别模块构成.首先,两个编码器提取内容和风格图像的潜在编码,然后将编码送入到AdaIN层中仿射变换,最后解码器还原翻译后的图像.该方法设计并训练了一个准确率90.282%的人脸识别模型并提出了一种联合人脸属性损失函数,增强了模型对风格人脸的属性的关注程度,解决了模型不能准确提取到人脸的属性信息以及摒弃了无关信息,使得模型能够生成清晰的、多属性的,多样的人脸翻译图像.该方法在公开的数据集CelebA-HQ实验并在定量和定性指标上都高于基线方法,在不同的人脸朝向时也表现出良好的鲁棒性.模型生成的图像还能应用于人脸图像生成领域,解决数据集匮乏等问题.To tackle the problem that the existing face image translation models cannot realize the translation among multiple visual attributes and the translated face images are not clear and natural,this study proposes a multi-attribute face image translation model based on the face recognition method.The model is mainly composed of the content and style encoder,AdaIN decoder,and face recognition module.First,the two encoders extract the potential encoding of the content and style image and then send the encoding into the AdaIN layer for affine transformation,and finally the decoder restores the translated image.A face recognition model is designed and trained using this method with an accuracy rate of 90.282%.A joint face attribute loss function is proposed,which enhances the model’s attention to the attributes of the style face,solves the problem that the model cannot accurately extract the attribute information of the face,and discards irrelevant information so that the model can generate clear,multi-attribute,and diverse face translation images.This method is tested on the open dataset CelebA-HQ,whose results are higher than the baselines in terms of both quantitative and qualitative indicators.It also shows good robustness in different face orientations.The image generated by the model can also be used in the field of face image generation to address dataset shortage.

关 键 词:人脸图像翻译 人脸识别 图像生成 人脸属性 无监督学习 风格翻译 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象