TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cross-View Tracking for Multi-Human 3D Pose Estimation at ...

Cross-View Tracking for Multi-Human 3D Pose Estimation at over 100 FPS

Long Chen, Haizhou Ai, Rui Chen, Zijie Zhuang, Shuang Liu

2020-03-09CVPR 2020 6Pose Estimation3D Pose Estimation3D Multi-Person Pose Estimation
PaperPDFCodeCode(official)

Abstract

Estimating 3D poses of multiple humans in real-time is a classic but still challenging task in computer vision. Its major difficulty lies in the ambiguity in cross-view association of 2D poses and the huge state space when there are multiple people in multiple views. In this paper, we present a novel solution for multi-human 3D pose estimation from multiple calibrated camera views. It takes 2D poses in different camera coordinates as inputs and aims for the accurate 3D poses in the global coordinate. Unlike previous methods that associate 2D poses among all pairs of views from scratch at every frame, we exploit the temporal consistency in videos to match the 2D inputs with 3D poses directly in 3-space. More specifically, we propose to retain the 3D pose for each person and update them iteratively via the cross-view multi-human tracking. This novel formulation improves both accuracy and efficiency, as we demonstrated on widely-used public datasets. To further verify the scalability of our method, we propose a new large-scale multi-human dataset with 12 to 28 camera views. Without bells and whistles, our solution achieves 154 FPS on 12 cameras and 34 FPS on 28 cameras, indicating its ability to handle large-scale real-world applications. The proposed dataset is released at https://github.com/longcw/crossview_3d_pose_tracking.

Results

TaskDatasetMetricValueModel
3D Human Pose EstimationShelfPCP3D96.8crossview_3d_pose_tracking
3D Human Pose EstimationShelfPCP3D96.8Cross-View
3D Human Pose EstimationCampusPCP3D96.6crossview_3d_pose_tracking
3D Human Pose EstimationCampusPCP3D96.6Cross-View
Pose EstimationShelfPCP3D96.8crossview_3d_pose_tracking
Pose EstimationShelfPCP3D96.8Cross-View
Pose EstimationCampusPCP3D96.6crossview_3d_pose_tracking
Pose EstimationCampusPCP3D96.6Cross-View
3DShelfPCP3D96.8crossview_3d_pose_tracking
3DShelfPCP3D96.8Cross-View
3DCampusPCP3D96.6crossview_3d_pose_tracking
3DCampusPCP3D96.6Cross-View
3D Multi-Person Pose EstimationShelfPCP3D96.8crossview_3d_pose_tracking
3D Multi-Person Pose EstimationShelfPCP3D96.8Cross-View
3D Multi-Person Pose EstimationCampusPCP3D96.6crossview_3d_pose_tracking
3D Multi-Person Pose EstimationCampusPCP3D96.6Cross-View
1 Image, 2*2 StitchiShelfPCP3D96.8crossview_3d_pose_tracking
1 Image, 2*2 StitchiShelfPCP3D96.8Cross-View
1 Image, 2*2 StitchiCampusPCP3D96.6crossview_3d_pose_tracking
1 Image, 2*2 StitchiCampusPCP3D96.6Cross-View

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16