TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OpenPifPaf: Composite Fields for Semantic Keypoint Detecti...

OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association

Sven Kreiss, Lorenzo Bertoni, Alexandre Alahi

2021-03-03Car Pose EstimationPose EstimationMulti-Person Pose EstimationKeypoint DetectionSelf-Driving Cars
PaperPDFCode(official)CodeCodeCode(official)CodeCode

Abstract

Many image-based perception tasks can be formulated as detecting, associating and tracking semantic keypoints, e.g., human body pose estimation and tracking. In this work, we present a general framework that jointly detects and forms spatio-temporal keypoint associations in a single stage, making this the first real-time pose detection and tracking algorithm. We present a generic neural network architecture that uses Composite Fields to detect and construct a spatio-temporal pose which is a single, connected graph whose nodes are the semantic keypoints (e.g., a person's body joints) in multiple frames. For the temporal associations, we introduce the Temporal Composite Association Field (TCAF) which requires an extended network architecture and training method beyond previous Composite Fields. Our experiments show competitive accuracy while being an order of magnitude faster on multiple publicly available datasets such as COCO, CrowdPose and the PoseTrack 2017 and 2018 datasets. We also show that our method generalizes to any class of semantic keypoints such as car and animal parts to provide a holistic perception framework that is well suited for urban mobility such as self-driving cars and delivery robots.

Results

TaskDatasetMetricValueModel
Pose EstimationCrowdPoseAP70.5OpenPifPaf
Pose EstimationCrowdPoseAP Easy78.4OpenPifPaf
Pose EstimationCrowdPoseAP Hard63.8OpenPifPaf
Pose EstimationCrowdPoseAP Medium72.1OpenPifPaf
Pose EstimationCrowdPoseAP5089.1OpenPifPaf
Pose EstimationCrowdPoseAP7576.1OpenPifPaf
Pose EstimationCOCO test-devAP70.9OpenPifPaf
Pose EstimationCOCO test-devAPL76.8OpenPifPaf
Pose EstimationCOCO test-devAPM67.1OpenPifPaf
Pose EstimationCOCO (Common Objects in Context)AP0.709OpenPifPaf
Pose EstimationCOCO (Common Objects in Context)Test AP70.9OpenPifPaf
Pose EstimationCOCO (Common Objects in Context)Validation AP71OpenPifPaf
Pose EstimationApolloCar3DDetection Rate86.1OpenPifPaf
3DCrowdPoseAP70.5OpenPifPaf
3DCrowdPoseAP Easy78.4OpenPifPaf
3DCrowdPoseAP Hard63.8OpenPifPaf
3DCrowdPoseAP Medium72.1OpenPifPaf
3DCrowdPoseAP5089.1OpenPifPaf
3DCrowdPoseAP7576.1OpenPifPaf
3DCOCO test-devAP70.9OpenPifPaf
3DCOCO test-devAPL76.8OpenPifPaf
3DCOCO test-devAPM67.1OpenPifPaf
3DCOCO (Common Objects in Context)AP0.709OpenPifPaf
3DCOCO (Common Objects in Context)Test AP70.9OpenPifPaf
3DCOCO (Common Objects in Context)Validation AP71OpenPifPaf
3DApolloCar3DDetection Rate86.1OpenPifPaf
Multi-Person Pose EstimationCOCO (Common Objects in Context)AP0.709OpenPifPaf
Multi-Person Pose EstimationCOCO (Common Objects in Context)Test AP70.9OpenPifPaf
Multi-Person Pose EstimationCOCO (Common Objects in Context)Validation AP71OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP70.5OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP Easy78.4OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP Hard63.8OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP Medium72.1OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP5089.1OpenPifPaf
1 Image, 2*2 StitchiCrowdPoseAP7576.1OpenPifPaf
1 Image, 2*2 StitchiCOCO test-devAP70.9OpenPifPaf
1 Image, 2*2 StitchiCOCO test-devAPL76.8OpenPifPaf
1 Image, 2*2 StitchiCOCO test-devAPM67.1OpenPifPaf
1 Image, 2*2 StitchiCOCO (Common Objects in Context)AP0.709OpenPifPaf
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP70.9OpenPifPaf
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Validation AP71OpenPifPaf
1 Image, 2*2 StitchiApolloCar3DDetection Rate86.1OpenPifPaf

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16