基于Transformer-CVAE的三维人体动画生成方法  被引量:1

3D Human Animation Synthesis with Transformer-CVAE

在线阅读下载全文

作  者:冯文科 石敏[1] 朱登明[2] 李兆歆 FENG Wenke;SHI Min;ZHU Dengming;LI Zhaoxin(School of Control and Computer Engineering,North China Electric Power University,Beijing 102206,China;Prospective Research Laboratory,Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China)

机构地区:[1]华北电力大学控制与计算机工程学院,北京102206 [2]中国科学院计算技术研究所前瞻研究实验室,北京100190

出  处:《计算机科学与探索》2023年第9期2137-2147,共11页Journal of Frontiers of Computer Science and Technology

基  金:国家自然科学基金(61972379)。

摘  要:三维人体动画生成技术是三维动画领域的核心技术。基于动作捕捉的人体动画生成方法通常制作流程较为繁琐、制作周期较长,无法快速生成人体动画,而现有数据驱动的方法生成的人体动画缺乏真实性,且生成人体运动的种类相对有限。基于此,提出了一种基于Transformer-CVAE的三维人体动画生成方法。首先,基于真实的人体运动构建人体运动数据集,并按照运动种类进行类别划分;其次,基于Transformer网络架构学习运动序列的时序依赖关系,进一步引入变分自编码器结构学习运动序列在隐空间上的概率分布;然后,在隐空间施加约束条件进而控制生成人体运动的效果;最后,在AMASS、HumanACT12、UESTC等数据集上进行实验,并从视觉效果与网络性能两方面对方法进行分析。实验结果表明,与现有方法相比,所提方法可生成种类丰富、真实细腻的人体动画,且在STED、RMSE等指标上具有明显的提升。3D human animation synthesis is a dominant technology in the domain of 3D animation.Traditional workflows depending on motion capture cannot generate human animation quickly due to complicated procedure and long authoring period.Existing data-driven methods have limited learning capability and therefore the generated animations are lack of realism and the categories of the generation are relatively limited.To that end,this paper presents a 3D human animation synthesis method based on a Transformer-based conditional variation autoencoder(Transformer-CVAE).Firstly,the motion dataset is constructed and classified by the motion category.Then,the temporal relationship between different frames in a common sequence is established by means of the Transformer architecture,and a variational autoencoder is further combined with the Transformer to infer the probabilistic distribution of human motions.Next,to control the desired body motion generated,the constraints are imposed on the latent space.Finally,a series of experiments are conducted on AMASS,HumanACT12 and UESTC datasets and the qualitative and quantitative evaluation is made from two aspects:the visual effect and the performance.Experimental results demonstrate that the method achieves superior performance in the metrics like STED,RMSE,etc.compared with the state-of-art,while capable of synthesizing various human animations with realism.

关 键 词:TRANSFORMER 条件变分自编码器 三维人体动画 计算机图形学 

分 类 号:TP391.41[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象