TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Non-Local Latent Relation Distillation for Self-Adaptive 3...

Non-Local Latent Relation Distillation for Self-Adaptive 3D Human Pose Estimation

Jogendra Nath Kundu, Siddharth Seth, Anirudh Jamkhandi, Pradyumna YM, Varun Jampani, Anirban Chakraborty, R. Venkatesh Babu

2022-04-05NeurIPS 2021 123D Human Pose EstimationWeakly-supervised 3D Human Pose EstimationUnsupervised 3D Human Pose EstimationPose Estimation
PaperPDF

Abstract

Available 3D human pose estimation approaches leverage different forms of strong (2D/3D pose) or weak (multi-view or depth) paired supervision. Barring synthetic or in-studio domains, acquiring such supervision for each new target environment is highly inconvenient. To this end, we cast 3D pose learning as a self-supervised adaptation problem that aims to transfer the task knowledge from a labeled source domain to a completely unpaired target. We propose to infer image-to-pose via two explicit mappings viz. image-to-latent and latent-to-pose where the latter is a pre-learned decoder obtained from a prior-enforcing generative adversarial auto-encoder. Next, we introduce relation distillation as a means to align the unpaired cross-modal samples i.e. the unpaired target videos and unpaired 3D pose sequences. To this end, we propose a new set of non-local relations in order to characterize long-range latent pose interactions unlike general contrastive relations where positive couplings are limited to a local neighborhood structure. Further, we provide an objective way to quantify non-localness in order to select the most effective relation set. We evaluate different self-adaptation settings and demonstrate state-of-the-art 3D human pose estimation performance on standard benchmarks.

Results

TaskDatasetMetricValueModel
3D Human Pose Estimation3DPWPA-MPJPE72.1Non-Local Latent Relation Distillation
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)57.6Non-Local Latent Relation Distillation
3D Human Pose EstimationHuman3.6MPA-MPJPE48.2Non-Local Latent Relation Distillation
3D ReconstructionHuman3.6MMPJPE97.8Non-Local Latent Relation Distillation
3D ReconstructionHuman3.6MPA-MPJPE86.2Non-Local Latent Relation Distillation
Pose Estimation3DPWPA-MPJPE72.1Non-Local Latent Relation Distillation
Pose EstimationHuman3.6MAverage MPJPE (mm)57.6Non-Local Latent Relation Distillation
Pose EstimationHuman3.6MPA-MPJPE48.2Non-Local Latent Relation Distillation
3D3DPWPA-MPJPE72.1Non-Local Latent Relation Distillation
3DHuman3.6MAverage MPJPE (mm)57.6Non-Local Latent Relation Distillation
3DHuman3.6MPA-MPJPE48.2Non-Local Latent Relation Distillation
3DHuman3.6MMPJPE97.8Non-Local Latent Relation Distillation
3DHuman3.6MPA-MPJPE86.2Non-Local Latent Relation Distillation
1 Image, 2*2 Stitchi3DPWPA-MPJPE72.1Non-Local Latent Relation Distillation
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)57.6Non-Local Latent Relation Distillation
1 Image, 2*2 StitchiHuman3.6MPA-MPJPE48.2Non-Local Latent Relation Distillation

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16