基于半监督学习的单视角点云三维人体姿态与形状估计

3D human pose and shape estimation from single-view point clouds with semi-supervised learning

作　　者：方程浩王康侃 FANG Chenghao;WANG Kangkan(Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education,Nanjing University of Science and Technology,Nanjing Jiangsu 210094,China)

机构地区：[1]南京理工大学高维信息智能感知与系统教育部重点实验室,江苏南京210094

出　　处：《图学学报》2025年第2期393-401,共9页Journal of Graphics

基　　金：国家自然科学基金(62472224);中央高校基础研究基金(NJ2023032);浙江大学计算机辅助设计与图形系统全国重点实验室开放课题(A2311);南京大学计算机软件新技术全国重点实验室开放课题(KFKT2024B37)。

摘　　要：在有限标签样本的条件下,单视角点云的三维人体姿态和形状估计一直存在模型估计精度低、泛化能力弱等问题。现有的方法通常采用微调方法优化模型,但对新样本的微调步骤大大增加了运行复杂度,本质上没有提高模型的泛化能力。为解决以上问题,提出了一种基于半监督学习的三维人体姿态与形状估计方法,在有限的标签数据条件下,利用大量无标签人体点云数据提高模型估计精度和泛化能力。具体地,首先对无标签数据进行弱增强和强增强,同时估计2种增强样本的三维人体参数模型。然后对弱增强样本的预测结果进行伪标签准确性判断,并基于一致性正则化思想约束强增强样本的预测结果,以迭代方式逐步优化伪标签质量和增加用于训练的伪标签数量,进而提升模型的估计精度。该算法在多种公开数据集上做了充分的定量和定性实验,实验结果证明该算法在有限标签样本的条件下提高了三维人体姿态和形状的估计精度,并增强了模型的泛化性能。Under the condition of limited labeled samples,estimating 3D human pose and shape from single-view point clouds has consistently encountered issues such as low model estimation accuracy and weak generalization capability.Existing methods typically use a fine-tuning step to optimize the models for limited labeled samples,but this fine-tuning process significantly increases computational complexity and without fundamentally enhancing model generalization.To address these issues,a semi-supervised learning-based method was proposed for 3D human pose and shape estimation.Under conditions of limited labeled data,the proposed method utilized a large amount of unlabeled human point clouds to improve model accuracy and generalization capability.Specifically,weak and strong augmentations were applied to the unlabeled data,and 3D human parameter models were estimated for both types of augmented samples.Then,the accuracy of pseudo-labels for weakly-augmented samples was evaluated,and the predictions of strongly augmented samples were constrained based on consistency regularization.The procedure above was applied iteratively to gradually refine the quality of pseudo-labels and increase the number of pseudo-labels for training,thereby enhancing the model’s estimation accuracy.Extensive quantitative and qualitative experiments on various public datasets demonstrate that the proposed method enhanced the accuracy of 3D human pose and shape estimation under conditions of limited labeled samples and enhanced model generalization performance.

关键词：三维人体姿态与形状估计单视角点云半监督学习伪标签点云数据增强

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于半监督学习的单视角点云三维人体姿态与形状估计

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于半监督学习的单视角点云三维人体姿态与形状估计

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索