TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/3D Human Shape and Pose from a Single Low-Resolution Image...

3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

Xiangyu Xu, Hao Chen, Francesc Moreno-Noguer, Laszlo A. Jeni, Fernando de la Torre

2020-07-27ECCV 2020 83D Human Pose EstimationSuper-Resolution3D Shape ReconstructionSelf-Supervised LearningPose EstimationContrastive LearningActivity Recognition
PaperPDFCodeCode(official)

Abstract

3D human shape and pose estimation from monocular images has been an active area of research in computer vision, having a substantial impact on the development of new applications, from activity recognition to creating virtual avatars. Existing deep learning methods for 3D human shape and pose estimation rely on relatively high-resolution input images; however, high-resolution visual content is not always available in several practical scenarios such as video surveillance and sports broadcasting. Low-resolution images in real scenarios can vary in a wide range of sizes, and a model trained in one resolution does not typically degrade gracefully across resolutions. Two common approaches to solve the problem of low-resolution input are applying super-resolution techniques to the input images which may result in visual artifacts, or simply training one model for each resolution, which is impractical in many realistic applications. To address the above issues, this paper proposes a novel algorithm called RSC-Net, which consists of a Resolution-aware network, a Self-supervision loss, and a Contrastive learning scheme. The proposed network is able to learn the 3D body shape and pose across different resolutions with a single model. The self-supervision loss encourages scale-consistency of the output, and the contrastive learning scheme enforces scale-consistency of the deep features. We show that both these new training losses provide robustness when learning 3D shape and pose in a weakly-supervised manner. Extensive experiments demonstrate that the RSC-Net can achieve consistently better results than the state-of-the-art methods for challenging low-resolution images.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPMPJPE103.36RSC-Net
3D Human Pose EstimationMPI-INF-3DHPPA-MPJPE70.01RSC-Net
3D Human Pose Estimation3DPWMPJPE96.36RSC-Net
3D Human Pose Estimation3DPWPA-MPJPE58.98RSC-Net
Pose EstimationMPI-INF-3DHPMPJPE103.36RSC-Net
Pose EstimationMPI-INF-3DHPPA-MPJPE70.01RSC-Net
Pose Estimation3DPWMPJPE96.36RSC-Net
Pose Estimation3DPWPA-MPJPE58.98RSC-Net
3DMPI-INF-3DHPMPJPE103.36RSC-Net
3DMPI-INF-3DHPPA-MPJPE70.01RSC-Net
3D3DPWMPJPE96.36RSC-Net
3D3DPWPA-MPJPE58.98RSC-Net
1 Image, 2*2 StitchiMPI-INF-3DHPMPJPE103.36RSC-Net
1 Image, 2*2 StitchiMPI-INF-3DHPPA-MPJPE70.01RSC-Net
1 Image, 2*2 Stitchi3DPWMPJPE96.36RSC-Net
1 Image, 2*2 Stitchi3DPWPA-MPJPE58.98RSC-Net

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17