TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PersonLab: Person Pose Estimation and Instance Segmentatio...

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

George Papandreou, Tyler Zhu, Liang-Chieh Chen, Spyros Gidaris, Jonathan Tompson, Kevin Murphy

2018-03-22ECCV 2018 9Semantic SegmentationPose EstimationMulti-Person Pose EstimationKeypoint DetectionInstance Segmentation
PaperPDFCodeCodeCode

Abstract

We present a box-free bottom-up approach for the tasks of pose estimation and instance segmentation of people in multi-person images using an efficient single-shot model. The proposed PersonLab model tackles both semantic-level reasoning and object-part associations using part-based modeling. Our model employs a convolutional network which learns to detect individual keypoints and predict their relative displacements, allowing us to group keypoints into person pose instances. Further, we propose a part-induced geometric embedding descriptor which allows us to associate semantic person pixels with their corresponding person instance, delivering instance-level person segmentations. Our system is based on a fully-convolutional architecture and allows for efficient inference, with runtime essentially independent of the number of people present in the scene. Trained on COCO data alone, our system achieves COCO test-dev keypoint average precision of 0.665 using single-scale inference and 0.687 using multi-scale inference, significantly outperforming all previous bottom-up pose estimation systems. We are also the first bottom-up method to report competitive results for the person class in the COCO instance segmentation task, achieving a person category average precision of 0.417.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO (Common Objects in Context)Test AP66.5PersonLab
Pose EstimationCOCO test-devAP68.7PersonLab
Pose EstimationCOCO test-devAP5089PersonLab
Pose EstimationCOCO test-devAP7575.4PersonLab
Pose EstimationCOCO test-devAPL75.5PersonLab
Pose EstimationCOCO test-devAPM64.1PersonLab
3DCOCO (Common Objects in Context)Test AP66.5PersonLab
3DCOCO test-devAP68.7PersonLab
3DCOCO test-devAP5089PersonLab
3DCOCO test-devAP7575.4PersonLab
3DCOCO test-devAPL75.5PersonLab
3DCOCO test-devAPM64.1PersonLab
Multi-Person Pose EstimationCOCO test-devAP68.7PersonLab
Multi-Person Pose EstimationCOCO test-devAP5089PersonLab
Multi-Person Pose EstimationCOCO test-devAP7575.4PersonLab
Multi-Person Pose EstimationCOCO test-devAPL75.5PersonLab
Multi-Person Pose EstimationCOCO test-devAPM64.1PersonLab
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP66.5PersonLab
1 Image, 2*2 StitchiCOCO test-devAP68.7PersonLab
1 Image, 2*2 StitchiCOCO test-devAP5089PersonLab
1 Image, 2*2 StitchiCOCO test-devAP7575.4PersonLab
1 Image, 2*2 StitchiCOCO test-devAPL75.5PersonLab
1 Image, 2*2 StitchiCOCO test-devAPM64.1PersonLab

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17