Muhammed Kocabas, Chun-Hao P. Huang, Otmar Hilliges, Michael J. Black
Despite significant progress, we show that state of the art 3D human pose and shape estimation methods remain sensitive to partial occlusion and can produce dramatically wrong predictions although much of the body is observable. To address this, we introduce a soft attention mechanism, called the Part Attention REgressor (PARE), that learns to predict body-part-guided attention masks. We observe that state-of-the-art methods rely on global feature representations, making them sensitive to even small occlusions. In contrast, PARE's part-guided attention mechanism overcomes these issues by exploiting information about the visibility of individual body parts while leveraging information from neighboring body-parts to predict occluded parts. We show qualitatively that PARE learns sensible attention masks, and quantitative evaluation confirms that PARE achieves more accurate and robust reconstruction results than existing approaches on both occlusion-specific and standard benchmarks. The code and data are available for research purposes at {\small \url{https://pare.is.tue.mpg.de/}}
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| 3D Human Pose Estimation | AGORA | B-MPJPE | 146.2 | PARE |
| 3D Human Pose Estimation | AGORA | B-MVE | 140.9 | PARE |
| 3D Human Pose Estimation | AGORA | B-NMJE | 174 | PARE |
| 3D Human Pose Estimation | AGORA | B-NMVE | 167.7 | PARE |
| 3D Human Pose Estimation | EMDB | Average MPJAE (deg) | 24.673 | PARE |
| 3D Human Pose Estimation | EMDB | Average MPJAE-PA (deg) | 22.3842 | PARE |
| 3D Human Pose Estimation | EMDB | Average MPJPE (mm) | 113.887 | PARE |
| 3D Human Pose Estimation | EMDB | Average MPJPE-PA (mm) | 72.203 | PARE |
| 3D Human Pose Estimation | EMDB | Average MVE (mm) | 133.247 | PARE |
| 3D Human Pose Estimation | EMDB | Average MVE-PA (mm) | 85.3788 | PARE |
| 3D Human Pose Estimation | EMDB | Jitter (10m/s^3) | 75.1137 | PARE |
| 3D Human Pose Estimation | AGORA | B-MPJPE | 146.2 | PARE |
| 3D Human Pose Estimation | AGORA | B-MVE | 140.9 | PARE |
| 3D Human Pose Estimation | AGORA | B-NMJE | 174 | PARE |
| 3D Human Pose Estimation | AGORA | B-NMVE | 167.7 | PARE |
| Pose Estimation | AGORA | B-MPJPE | 146.2 | PARE |
| Pose Estimation | AGORA | B-MVE | 140.9 | PARE |
| Pose Estimation | AGORA | B-NMJE | 174 | PARE |
| Pose Estimation | AGORA | B-NMVE | 167.7 | PARE |
| Pose Estimation | EMDB | Average MPJAE (deg) | 24.673 | PARE |
| Pose Estimation | EMDB | Average MPJAE-PA (deg) | 22.3842 | PARE |
| Pose Estimation | EMDB | Average MPJPE (mm) | 113.887 | PARE |
| Pose Estimation | EMDB | Average MPJPE-PA (mm) | 72.203 | PARE |
| Pose Estimation | EMDB | Average MVE (mm) | 133.247 | PARE |
| Pose Estimation | EMDB | Average MVE-PA (mm) | 85.3788 | PARE |
| Pose Estimation | EMDB | Jitter (10m/s^3) | 75.1137 | PARE |
| Pose Estimation | AGORA | B-MPJPE | 146.2 | PARE |
| Pose Estimation | AGORA | B-MVE | 140.9 | PARE |
| Pose Estimation | AGORA | B-NMJE | 174 | PARE |
| Pose Estimation | AGORA | B-NMVE | 167.7 | PARE |
| 3D | AGORA | B-MPJPE | 146.2 | PARE |
| 3D | AGORA | B-MVE | 140.9 | PARE |
| 3D | AGORA | B-NMJE | 174 | PARE |
| 3D | AGORA | B-NMVE | 167.7 | PARE |
| 3D | EMDB | Average MPJAE (deg) | 24.673 | PARE |
| 3D | EMDB | Average MPJAE-PA (deg) | 22.3842 | PARE |
| 3D | EMDB | Average MPJPE (mm) | 113.887 | PARE |
| 3D | EMDB | Average MPJPE-PA (mm) | 72.203 | PARE |
| 3D | EMDB | Average MVE (mm) | 133.247 | PARE |
| 3D | EMDB | Average MVE-PA (mm) | 85.3788 | PARE |
| 3D | EMDB | Jitter (10m/s^3) | 75.1137 | PARE |
| 3D | AGORA | B-MPJPE | 146.2 | PARE |
| 3D | AGORA | B-MVE | 140.9 | PARE |
| 3D | AGORA | B-NMJE | 174 | PARE |
| 3D | AGORA | B-NMVE | 167.7 | PARE |
| 3D Multi-Person Pose Estimation | AGORA | B-MPJPE | 146.2 | PARE |
| 3D Multi-Person Pose Estimation | AGORA | B-MVE | 140.9 | PARE |
| 3D Multi-Person Pose Estimation | AGORA | B-NMJE | 174 | PARE |
| 3D Multi-Person Pose Estimation | AGORA | B-NMVE | 167.7 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-MPJPE | 146.2 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-MVE | 140.9 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-NMJE | 174 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-NMVE | 167.7 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MPJAE (deg) | 24.673 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MPJAE-PA (deg) | 22.3842 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MPJPE (mm) | 113.887 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MPJPE-PA (mm) | 72.203 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MVE (mm) | 133.247 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Average MVE-PA (mm) | 85.3788 | PARE |
| 1 Image, 2*2 Stitchi | EMDB | Jitter (10m/s^3) | 75.1137 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-MPJPE | 146.2 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-MVE | 140.9 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-NMJE | 174 | PARE |
| 1 Image, 2*2 Stitchi | AGORA | B-NMVE | 167.7 | PARE |