深度学习汉字生成与字体风格迁移综述  被引量:8

Review of Chinese characters generation and font transfer based on deep learning

在线阅读下载全文

作  者:王晨[1] 吴国华[1] 姚晔[1] 任一支[1] 王秋华[1] 袁理锋 Wang Chen;Wu Guohua;Yao Ye;Ren Yizhi;Wang Qiuhua;Yuan Lifeng(School of Cyberspace,Hangzhou Dianzi University,Hangzhou 310018,China)

机构地区:[1]杭州电子科技大学网络空间安全学院,杭州310018

出  处:《中国图象图形学报》2022年第12期3415-3428,共14页Journal of Image and Graphics

基  金:国家自然科学基金项目(62071267)。

摘  要:汉字字体风格迁移旨在保证在语义内容不变的同时对汉字的字形作相应的转换。由于深度学习在图像风格迁移任务中表现出色,因此汉字生成可以从汉字图像入手,利用此技术实现汉字字体的转换,减少字体设计的人工干预,减轻字体设计的工作负担。然而,如何提高生成图像的质量仍是一个亟待解决的问题。本文首先系统梳理了当前汉字字体风格迁移的相关工作,将其分为3类,即基于卷积神经网络(convolutional neural network,CNN)、自编码器(auto-encoder,AE)和生成对抗网络(generative adversarial network,GAN)的汉字字体风格迁移方法。然后,对比分析了22种汉字字体风格迁移方法在数据集规模方面的需求和对不同字体类别转换的适用能力,并归纳了这些方法的特点,包括细化汉字图像特征、依赖预训练模型提取有效特征、支持去风格化等。同时,按照汉字部首检字表构造包含多种汉字字体的简繁体汉字图像数据集,并选取代表性的汉字字体风格迁移方法进行对比实验,实现源字体(仿宋)到目标字体(印刷体和手写体)的转换,展示并分析Rewrite2、zi2zi、TET-GAN(texture effects transfer GAN)和Unet-GAN等4种代表性汉字字体风格迁移方法的生成效果。最后,对该领域的现状和挑战进行总结,展望该领域未来发展方向。由于汉字具有数量庞大和风格多样的特性,因此基于深度学习的汉字生成与字体风格迁移技术还不够成熟。未来该领域将从融合汉字的风格化与去风格化为一体、有效提取汉字特征等方面进一步探索,使字体设计工作向更灵活、个性化的方向发展。Deep learning technology is capable of image-style transfer tasks recently.The Chinese characters font transfer is focused on content preservation while the font attribute is converted.Thanks to the emerging deep learning,the workload of font design for Chinese characters can be alleviated effectively and the restrictions of human intervention are avoided as well.However,the quality of generated images is still a challenging issue to be resolved.Our review is aimed at the analysis of the most representative image generation and font transfer methods for Chinese characters.The literature review of contemporary font transfer methods for Chinese characters is systematically summarized and divided into three categories:1)convolutional neural network based(CNN-based),2)auto-encoder based(AE-based),and 3)generative adversarial networks based(GAN-based).To avoid information missing in the process of data reconstruction,a convolutional neural network extracted features of images without changing the dimensions of data.Auto-encoder processed the data through a deep neural network to learn the distribution of real samples and generate realistic fake samples.Generative adversarial networks became popular in Chinese characters font transfer after being proposed by Goodfellow.Its structure consists of a generator and a discriminator generally.The core idea of generative adversarial networks came from the Nash equilibrium of game theory,which is reflected in the process of continuous optimization between the generator and discriminator.Its generator learned the distribution of real data,generated fake images,and induced discriminators to make wrong decisions.The discriminator tried to determine whether the input data is real or fake.Through this game between generator and discriminator,the latter could not distinguish the real image from the fake in the end.According to the way of learning font style features of Chinese characters,we divided these methods based on GAN into three categories:1)self-learning font style features,2

关 键 词:汉字字体风格迁移 图像生成 卷积神经网络(CNN) 自编码器(AE) 生成对抗网络(GAN) 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象