融合注意力机制的多模态动漫风格迁移方法被引量：2

Multimodal Animation Style Transfer Method Fused with Attention Mechanism

作　　者：聂雄锋王俊英[1,2,3] 董方敏[3] 臧兆祥[1,3] 江曙 NIE Xiongfeng;WANG Junying;DONG Fangmin;ZANG Zhaoxiang;JIANG Shu(College of Computer and Information Technology,China Three Gorges University,Yichang,Hubei 443002,China;Hubei Construction Quality Inspection Equipment Engineering Technology Research Center,China Three Gorges University,Yichang,Hubei 443002,China;Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering,China Three Gorges University,Yichang,Hubei 443002,China)

机构地区：[1]三峡大学计算机与信息学院,湖北宜昌443002 [2]三峡大学湖北省建筑质量检测装备工程技术研究中心,湖北宜昌443002 [3]三峡大学水电工程智能视觉监测湖北省重点实验室,湖北宜昌443002

出　　处：《计算机工程与应用》2023年第15期223-234,共12页Computer Engineering and Applications

基　　金：国家自然科学基金新疆联合基金重点项目(U1703261);湖北省水电工程智能视觉监测开放基金(2017SDSJ04)。

摘　　要：由于没有与图像的内容结构相匹配,目前的一些方法在针对具有复杂语义信息和显著性特征的图像的动漫风格迁移时,生成图像存在风格色彩不丰富、伪影、部分内容细节信息丢失等现象,提出一种融合注意力机制的多模态动漫风格迁移方法MastGAN-CBAM,将动漫图像特征聚类成若干子特征分量,并利用GraphCut算法使得这些特征分量和各局部内容图像特征相匹配,再利用Gram矩阵计算这些特征的风格损失,从而构造了一种多模态风格损失函数,由于这种风格损失适应了图像的多模态特征,因此能更有效地对网络参数进行优化和调整,此外方法还引入了混合域注意力机制,提高了模型的效率和准确性,进一步提升了动漫风格迁移效果。实验结果表明,该方法的生成图像细节更完整,动漫风格更显著,且减少了伪影,动漫化效果有一定程度的提高,在《千与千寻》等三组动漫数据集实验中FID评价指标分别达到了164.89、162.02、199.37,在视频动漫风格迁移中也取得了较好的效果。Due to the lack of matching with the content structure of the image,when some current methods transfer the animation style of the image with complex semantic information and salient features,the generated image has the phenomena of insufficient style color,artifact,loss of some content details,etc.This paper proposes a multi-modal animation style transfer method fused with attention mechanism,mastgan CBAM,which clusters the animation image features into several sub feature components,The graphcut algorithm is used to match these feature components with the local content image features,and then the Gram matrix is used to calculate the style loss of these features,so a multimodal style loss function is constructed.Because this style loss adapts to the multimodal features of the image,the network parameters can be optimized and adjusted more effectively.In addition,the method also introduces a hybrid domain attention mechanism,It improves the efficiency and accuracy of the model,and further improves the effect of animation style migration.The experimental results show that the image details generated by this method are more complete,the animation style is more significant,and the artifact is reduced,and the animation effect is improved to a certain extent.In the experiments of three groups of animation data sets such as“Chihiro”,the FID evaluation indicators have reached 164.89,162.02 and 199.37 respectively,and good results have been achieved in the style transfer of video animation.

关键词：深度学习动漫风格迁移生成对抗网络多模态匹配注意力机制

分类号：TP391.41[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合注意力机制的多模态动漫风格迁移方法被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

融合注意力机制的多模态动漫风格迁移方法 被引量：2

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

融合注意力机制的多模态动漫风格迁移方法被引量：2