检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:聂雄锋 王俊英[1,2,3] 董方敏[3] 臧兆祥[1,3] 江曙 NIE Xiongfeng;WANG Junying;DONG Fangmin;ZANG Zhaoxiang;JIANG Shu(College of Computer and Information Technology,China Three Gorges University,Yichang,Hubei 443002,China;Hubei Construction Quality Inspection Equipment Engineering Technology Research Center,China Three Gorges University,Yichang,Hubei 443002,China;Hubei Key Laboratory of Intelligent Vision Based Monitoring for Hydroelectric Engineering,China Three Gorges University,Yichang,Hubei 443002,China)
机构地区:[1]三峡大学计算机与信息学院,湖北宜昌443002 [2]三峡大学湖北省建筑质量检测装备工程技术研究中心,湖北宜昌443002 [3]三峡大学水电工程智能视觉监测湖北省重点实验室,湖北宜昌443002
出 处:《计算机工程与应用》2023年第15期223-234,共12页Computer Engineering and Applications
基 金:国家自然科学基金新疆联合基金重点项目(U1703261);湖北省水电工程智能视觉监测开放基金(2017SDSJ04)。
摘 要:由于没有与图像的内容结构相匹配,目前的一些方法在针对具有复杂语义信息和显著性特征的图像的动漫风格迁移时,生成图像存在风格色彩不丰富、伪影、部分内容细节信息丢失等现象,提出一种融合注意力机制的多模态动漫风格迁移方法MastGAN-CBAM,将动漫图像特征聚类成若干子特征分量,并利用GraphCut算法使得这些特征分量和各局部内容图像特征相匹配,再利用Gram矩阵计算这些特征的风格损失,从而构造了一种多模态风格损失函数,由于这种风格损失适应了图像的多模态特征,因此能更有效地对网络参数进行优化和调整,此外方法还引入了混合域注意力机制,提高了模型的效率和准确性,进一步提升了动漫风格迁移效果。实验结果表明,该方法的生成图像细节更完整,动漫风格更显著,且减少了伪影,动漫化效果有一定程度的提高,在《千与千寻》等三组动漫数据集实验中FID评价指标分别达到了164.89、162.02、199.37,在视频动漫风格迁移中也取得了较好的效果。Due to the lack of matching with the content structure of the image,when some current methods transfer the animation style of the image with complex semantic information and salient features,the generated image has the phenomena of insufficient style color,artifact,loss of some content details,etc.This paper proposes a multi-modal animation style transfer method fused with attention mechanism,mastgan CBAM,which clusters the animation image features into several sub feature components,The graphcut algorithm is used to match these feature components with the local content image features,and then the Gram matrix is used to calculate the style loss of these features,so a multimodal style loss function is constructed.Because this style loss adapts to the multimodal features of the image,the network parameters can be optimized and adjusted more effectively.In addition,the method also introduces a hybrid domain attention mechanism,It improves the efficiency and accuracy of the model,and further improves the effect of animation style migration.The experimental results show that the image details generated by this method are more complete,the animation style is more significant,and the artifact is reduced,and the animation effect is improved to a certain extent.In the experiments of three groups of animation data sets such as“Chihiro”,the FID evaluation indicators have reached 164.89,162.02 and 199.37 respectively,and good results have been achieved in the style transfer of video animation.
关 键 词:深度学习 动漫风格迁移 生成对抗网络 多模态匹配 注意力机制
分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.3