检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:杨硕[1] 王一丁[1] YANG Shuo;WANG Yiding(School of Information,North China University of Technology,Beijing 100144,China)
出 处:《计算机工程》2024年第6期255-265,共11页Computer Engineering
基 金:国家自然科学基金(62276018)。
摘 要:面部动画在电影、游戏、虚拟现实等领域起着关键作用,对于实现逼真、生动的人脸动画和情感传达至关重要。当面临面部形状、姿态、表情等多个变化因素时,虽然通过薄板样条非线性变换可以获得较好的运动估计结果,但在处理面部复杂纹理和嘴部运动时存在运动估计不精细的问题,需要更强大的图像修复能力。因此,提出一种基于改进薄板样条运动模型(TPSMM)的人脸动画算法。首先,在TPSMM的基础上引入一种Farneback光流金字塔算法,通过与薄板样条变换和背景仿射变换相结合,使得人脸局部运动估计更精准;其次,为了更真实地恢复缺失区域的细节纹理信息,提出一种多尺度细节感知网络,该网络在编码器中通过嵌入通道注意力(ECA)模块减少源图像因多层下采样而导致的人脸细节信息丢失,在解码器中利用坐标注意力(CA)模块来有效捕获运动估计特征图中不同位置的重要特征,提高人脸图像的生成质量。实验结果表明,相比一阶段运动模型(FOMM)、关节动画的运动表示法(MRAA)、TPSMM等,该算法在MUG、UvA-Nemo和Oulu-CASIA数据集上的L1、平均关键点距离(AKD)、平均欧氏距离(AED)数值均达到最优,平均分别为0.0129、0.923、0.00099。Facial animation plays a crucial role in applications involving movies,games,and virtual reality in terms of achieving realistic and vivid emotional communication.When handling multiple factors,such as facial shape,posture,and expression,good motion estimation results can be obtained through thin plate spline nonlinear transformation.However,this approach results in imprecise motion estimation when dealing with complex facial textures and mouth movements,necessitating better image restoration capabilities.To address this issue,this paper proposes a facial animation algorithm based on an improved Thin Plate Spline Motion Model(TPSMM).First,based on TPSMM,a Farneback optical flow pyramid algorithm is introduced,which combines the thin plate spline and background affine transformations to enhance the accuracy of local facial motion estimation.Second,to accurately recover the detailed textural information for missing areas,a multi-scale detail perception network is introduced.This network minimizes the loss of facial detail information caused by multi-layer downsampling of the source image by Embedding Channel Attention(ECA)modules in the encoder.In the decoder,the Coordinate Attention(CA)module effectively captures important features at different positions in the motion estimation feature map,thereby improving the quality of facial image generation.Experimental results show that,compared to the First Order Motion Model(FOMM),Motion Representations for Articulated Animation(MRAA),and TPSMM,the proposed algorithm achieves optimal L1,Average Keypoint Distance(AKD),and Average Euclidean Distance(AED)values on the MUG,UvA-Nemo,and Oulu-CASIA datasets,with averages of 0.0129,0.923,and 0.00099,respectively.
关 键 词:面部动画 光流估计 薄板样条 多尺度特征融合 通道注意力机制 坐标注意力机制
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.191.201.27