Akash Sengupta, Ignas Budvytis, Roberto Cipolla
This paper addresses the problem of 3D human body shape and pose estimation from an RGB image. This is often an ill-posed problem, since multiple plausible 3D bodies may match the visual evidence present in the input - particularly when the subject is occluded. Thus, it is desirable to estimate a distribution over 3D body shape and pose conditioned on the input image instead of a single 3D reconstruction. We train a deep neural network to estimate a hierarchical matrix-Fisher distribution over relative 3D joint rotation matrices (i.e. body pose), which exploits the human body's kinematic tree structure, as well as a Gaussian distribution over SMPL body shape parameters. To further ensure that the predicted shape and pose distributions match the visual evidence in the input image, we implement a differentiable rejection sampler to impose a reprojection loss between ground-truth 2D joint coordinates and samples from the predicted distributions, projected onto the image plane. We show that our method is competitive with the state-of-the-art in terms of 3D shape and pose metrics on the SSP-3D and 3DPW datasets, while also yielding a structured probability distribution over 3D body shape and pose, with which we can meaningfully quantify prediction uncertainty and sample multiple plausible 3D reconstructions to explain a given input image. Code is available at https://github.com/akashsengupta1997/HierarchicalProbabilistic3DHuman .
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| 3D Human Pose Estimation | 3DPW | MPJPE | 84.9 | Hierarchical Probabilistic Humans |
| 3D Human Pose Estimation | 3DPW | PA-MPJPE | 53.6 | Hierarchical Probabilistic Humans |
| 3D Human Pose Estimation | SSP-3D | PVE-T-SC | 13.6 | Hierarchical Probabilistic Humans |
| Pose Estimation | 3DPW | MPJPE | 84.9 | Hierarchical Probabilistic Humans |
| Pose Estimation | 3DPW | PA-MPJPE | 53.6 | Hierarchical Probabilistic Humans |
| Pose Estimation | SSP-3D | PVE-T-SC | 13.6 | Hierarchical Probabilistic Humans |
| 3D | 3DPW | MPJPE | 84.9 | Hierarchical Probabilistic Humans |
| 3D | 3DPW | PA-MPJPE | 53.6 | Hierarchical Probabilistic Humans |
| 3D | SSP-3D | PVE-T-SC | 13.6 | Hierarchical Probabilistic Humans |
| 3D Absolute Human Pose Estimation | SSP-3D | PVE-T-SC | 13.6 | Hierarchical Probabilistic Humans |
| 1 Image, 2*2 Stitchi | 3DPW | MPJPE | 84.9 | Hierarchical Probabilistic Humans |
| 1 Image, 2*2 Stitchi | 3DPW | PA-MPJPE | 53.6 | Hierarchical Probabilistic Humans |
| 1 Image, 2*2 Stitchi | SSP-3D | PVE-T-SC | 13.6 | Hierarchical Probabilistic Humans |