TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TRAM: Global Trajectory and Motion of 3D Humans from in-th...

TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos

Yufu Wang, ZiYun Wang, Lingjie Liu, Kostas Daniilidis

2024-03-263D Human Pose Estimation
PaperPDFCode(official)

Abstract

We propose TRAM, a two-stage method to reconstruct a human's global trajectory and motion from in-the-wild videos. TRAM robustifies SLAM to recover the camera motion in the presence of dynamic humans and uses the scene background to derive the motion scale. Using the recovered camera as a metric-scale reference frame, we introduce a video transformer model (VIMO) to regress the kinematic body motion of a human. By composing the two motions, we achieve accurate recovery of 3D humans in the world space, reducing global motion errors by a large margin from prior work. https://yufu-wang.github.io/tram4d/

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationEMDBAverage MPJPE (mm)74.4TRAM
3D Human Pose EstimationEMDBAverage MPJPE-PA (mm)45.7TRAM
3D Human Pose EstimationEMDBAverage MVE (mm)86.6TRAM
Pose EstimationEMDBAverage MPJPE (mm)74.4TRAM
Pose EstimationEMDBAverage MPJPE-PA (mm)45.7TRAM
Pose EstimationEMDBAverage MVE (mm)86.6TRAM
3DEMDBAverage MPJPE (mm)74.4TRAM
3DEMDBAverage MPJPE-PA (mm)45.7TRAM
3DEMDBAverage MVE (mm)86.6TRAM
1 Image, 2*2 StitchiEMDBAverage MPJPE (mm)74.4TRAM
1 Image, 2*2 StitchiEMDBAverage MPJPE-PA (mm)45.7TRAM
1 Image, 2*2 StitchiEMDBAverage MVE (mm)86.6TRAM

Related Papers

Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images2025-06-24ExtPose: Robust and Coherent Pose Estimation by Extending ViTs2025-06-18PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation2025-06-17Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation2025-06-03UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction2025-05-20PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation2025-05-16HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation2025-05-07Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation2025-05-04