TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/POTTER: Pooling Attention Transformer for Efficient Human ...

POTTER: Pooling Attention Transformer for Efficient Human Mesh Recovery

Ce Zheng, Xianpeng Liu, Guo-Jun Qi, Chen Chen

2023-03-23CVPR 2023 13D Human Pose EstimationHuman Mesh Recovery
PaperPDFCode

Abstract

Transformer architectures have achieved SOTA performance on the human mesh recovery (HMR) from monocular images. However, the performance gain has come at the cost of substantial memory and computational overhead. A lightweight and efficient model to reconstruct accurate human mesh is needed for real-world applications. In this paper, we propose a pure transformer architecture named POoling aTtention TransformER (POTTER) for the HMR task from single images. Observing that the conventional attention module is memory and computationally expensive, we propose an efficient pooling attention module, which significantly reduces the memory and computational cost without sacrificing performance. Furthermore, we design a new transformer architecture by integrating a High-Resolution (HR) stream for the HMR task. The high-resolution local and global features from the HR stream can be utilized for recovering more accurate human mesh. Our POTTER outperforms the SOTA method METRO by only requiring 7% of total parameters and 14% of the Multiply-Accumulate Operations on the Human3.6M (PA-MPJPE metric) and 3DPW (all three metrics) datasets. The project webpage is https://zczcwh.github.io/potter_page.

Results

TaskDatasetMetricValueModel
3D Human Pose Estimation3DPWMPJPE75POTTER
3D Human Pose Estimation3DPWMPVPE87.4POTTER
3D Human Pose Estimation3DPWPA-MPJPE44.8POTTER
Pose Estimation3DPWMPJPE75POTTER
Pose Estimation3DPWMPVPE87.4POTTER
Pose Estimation3DPWPA-MPJPE44.8POTTER
3D3DPWMPJPE75POTTER
3D3DPWMPVPE87.4POTTER
3D3DPWPA-MPJPE44.8POTTER
1 Image, 2*2 Stitchi3DPWMPJPE75POTTER
1 Image, 2*2 Stitchi3DPWMPVPE87.4POTTER
1 Image, 2*2 Stitchi3DPWPA-MPJPE44.8POTTER

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17MetricHMR: Metric Human Mesh Recovery from Monocular Images2025-06-11Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation2025-05-16ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization2025-05-15