TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SRNet: Improving Generalization in 3D Human Pose Estimatio...

SRNet: Improving Generalization in 3D Human Pose Estimation with a Split-and-Recombine Approach

Ailing Zeng, Xiao Sun, Fuyang Huang, Minhao Liu, Qiang Xu, Stephen Lin

2020-07-18ECCV 2020 83D Human Pose EstimationMonocular 3D Human Pose EstimationPose Estimation
PaperPDFCode(official)

Abstract

Human poses that are rare or unseen in a training set are challenging for a network to predict. Similar to the long-tailed distribution problem in visual recognition, the small number of examples for such poses limits the ability of networks to model them. Interestingly, local pose distributions suffer less from the long-tail problem, i.e., local joint configurations within a rare pose may appear within other poses in the training set, making them less rare. We propose to take advantage of this fact for better generalization to rare and unseen poses. To be specific, our method splits the body into local regions and processes them in separate network branches, utilizing the property that a joint position depends mainly on the joints within its local body region. Global coherence is maintained by recombining the global context from the rest of the body into each branch as a low-dimensional vector. With the reduced dimensionality of less relevant body areas, the training set distribution within network branches more closely reflects the statistics of local poses instead of global body poses, without sacrificing information important for joint inference. The proposed split-and-recombine approach, called SRNet, can be easily adapted to both single-image and temporal models, and it leads to appreciable improvements in the prediction of rare and unseen poses.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationMPI-INF-3DHPAUC43.8SRNET
3D Human Pose EstimationMPI-INF-3DHPPCK77.6SRNET
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)44.8SRNet (T=243)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)49.9SRNet (T=1)
3D Human Pose EstimationHuman3.6MAverage MPJPE (mm)49.9SRNET
3D Human Pose EstimationHuman3.6MFrames Needed1SRNET
Pose EstimationMPI-INF-3DHPAUC43.8SRNET
Pose EstimationMPI-INF-3DHPPCK77.6SRNET
Pose EstimationHuman3.6MAverage MPJPE (mm)44.8SRNet (T=243)
Pose EstimationHuman3.6MAverage MPJPE (mm)49.9SRNet (T=1)
Pose EstimationHuman3.6MAverage MPJPE (mm)49.9SRNET
Pose EstimationHuman3.6MFrames Needed1SRNET
3DMPI-INF-3DHPAUC43.8SRNET
3DMPI-INF-3DHPPCK77.6SRNET
3DHuman3.6MAverage MPJPE (mm)44.8SRNet (T=243)
3DHuman3.6MAverage MPJPE (mm)49.9SRNet (T=1)
3DHuman3.6MAverage MPJPE (mm)49.9SRNET
3DHuman3.6MFrames Needed1SRNET
1 Image, 2*2 StitchiMPI-INF-3DHPAUC43.8SRNET
1 Image, 2*2 StitchiMPI-INF-3DHPPCK77.6SRNET
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)44.8SRNet (T=243)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)49.9SRNet (T=1)
1 Image, 2*2 StitchiHuman3.6MAverage MPJPE (mm)49.9SRNET
1 Image, 2*2 StitchiHuman3.6MFrames Needed1SRNET

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16