Shichao Li, Lei Ke, Kevin Pratama, Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng
End-to-end deep representation learning has achieved remarkable accuracy for monocular 3D human pose estimation, yet these models may fail for unseen poses with limited and fixed training data. This paper proposes a novel data augmentation method that: (1) is scalable for synthesizing massive amount of training data (over 8 million valid 3D human poses with corresponding 2D projections) for training 2D-to-3D networks, (2) can effectively reduce dataset bias. Our method evolves a limited dataset to synthesize unseen 3D human skeletons based on a hierarchical human representation and heuristics inspired by prior knowledge. Extensive experiments show that our approach not only achieves state-of-the-art accuracy on the largest public benchmark, but also generalizes significantly better to unseen and rare poses. Code, pre-trained models and tools are available at this HTTPS URL.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| 3D Human Pose Estimation | MPI-INF-3DHP | AUC | 46.1 | EvoSkeleton |
| 3D Human Pose Estimation | MPI-INF-3DHP | MPJPE | 99.7 | EvoSkeleton |
| 3D Human Pose Estimation | MPI-INF-3DHP | PCK | 81.2 | EvoSkeleton |
| 3D Human Pose Estimation | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 3D Human Pose Estimation | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 3D Human Pose Estimation | Human3.6M | Frames Needed | 1 | TAG-Net |
| 3D Human Pose Estimation | Human3.6M | Average MPJPE (mm) | 62.9 | Li et al. |
| Pose Estimation | MPI-INF-3DHP | AUC | 46.1 | EvoSkeleton |
| Pose Estimation | MPI-INF-3DHP | MPJPE | 99.7 | EvoSkeleton |
| Pose Estimation | MPI-INF-3DHP | PCK | 81.2 | EvoSkeleton |
| Pose Estimation | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| Pose Estimation | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| Pose Estimation | Human3.6M | Frames Needed | 1 | TAG-Net |
| Pose Estimation | Human3.6M | Average MPJPE (mm) | 62.9 | Li et al. |
| 3D | MPI-INF-3DHP | AUC | 46.1 | EvoSkeleton |
| 3D | MPI-INF-3DHP | MPJPE | 99.7 | EvoSkeleton |
| 3D | MPI-INF-3DHP | PCK | 81.2 | EvoSkeleton |
| 3D | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 3D | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 3D | Human3.6M | Frames Needed | 1 | TAG-Net |
| 3D | Human3.6M | Average MPJPE (mm) | 62.9 | Li et al. |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | AUC | 46.1 | EvoSkeleton |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | MPJPE | 99.7 | EvoSkeleton |
| 1 Image, 2*2 Stitchi | MPI-INF-3DHP | PCK | 81.2 | EvoSkeleton |
| 1 Image, 2*2 Stitchi | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 1 Image, 2*2 Stitchi | Human3.6M | Average MPJPE (mm) | 50.9 | TAG-Net |
| 1 Image, 2*2 Stitchi | Human3.6M | Frames Needed | 1 | TAG-Net |
| 1 Image, 2*2 Stitchi | Human3.6M | Average MPJPE (mm) | 62.9 | Li et al. |