Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction

Shanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang

2021-03-30CVPR 2021 13D Human Pose Estimation

Abstract

This paper considers a new problem of adapting a pre-trained model of human mesh reconstruction to out-of-domain streaming videos. However, most previous methods based on the parametric SMPL model \cite{loper2015smpl} underperform in new domains with unexpected, domain-specific attributes, such as camera parameters, lengths of bones, backgrounds, and occlusions. Our general idea is to dynamically fine-tune the source model on test video streams with additional temporal constraints, such that it can mitigate the domain gaps without over-fitting the 2D information of individual test frames. A subsequent challenge is how to avoid conflicts between the 2D and temporal constraints. We propose to tackle this problem using a new training algorithm named Bilevel Online Adaptation (BOA), which divides the optimization process of overall multi-objective into two steps of weight probe and weight update in a training iteration. We demonstrate that BOA leads to state-of-the-art results on two human mesh reconstruction benchmarks.

Results

Task	Dataset	Metric	Value	Model
3D Human Pose Estimation	3DPW	MPJPE	77.2	BOA (w/ 2D GT)
3D Human Pose Estimation	3DPW	MPVPE	91.2	BOA (w/ 2D GT)
3D Human Pose Estimation	3DPW	PA-MPJPE	49.5	BOA (w/ 2D GT)
Pose Estimation	3DPW	MPJPE	77.2	BOA (w/ 2D GT)
Pose Estimation	3DPW	MPVPE	91.2	BOA (w/ 2D GT)
Pose Estimation	3DPW	PA-MPJPE	49.5	BOA (w/ 2D GT)
3D	3DPW	MPJPE	77.2	BOA (w/ 2D GT)
3D	3DPW	MPVPE	91.2	BOA (w/ 2D GT)
3D	3DPW	PA-MPJPE	49.5	BOA (w/ 2D GT)
1 Image, 2*2 Stitchi	3DPW	MPJPE	77.2	BOA (w/ 2D GT)
1 Image, 2*2 Stitchi	3DPW	MPVPE	91.2	BOA (w/ 2D GT)
1 Image, 2*2 Stitchi	3DPW	PA-MPJPE	49.5	BOA (w/ 2D GT)

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24 ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18 PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17 Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03 UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20 PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation2025-05-16 HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation2025-05-07 Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation2025-05-04