TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EvoPose2D: Pushing the Boundaries of 2D Human Pose Estimat...

EvoPose2D: Pushing the Boundaries of 2D Human Pose Estimation using Accelerated Neuroevolution with Weight Transfer

William McNally, Kanav Vats, Alexander Wong, John McPhee

2020-11-172D Human Pose EstimationNeural Architecture SearchPose EstimationMulti-Person Pose EstimationKeypoint Detection
PaperPDFCode(official)

Abstract

Neural architecture search has proven to be highly effective in the design of efficient convolutional neural networks that are better suited for mobile deployment than hand-designed networks. Hypothesizing that neural architecture search holds great potential for human pose estimation, we explore the application of neuroevolution, a form of neural architecture search inspired by biological evolution, in the design of 2D human pose networks for the first time. Additionally, we propose a new weight transfer scheme that enables us to accelerate neuroevolution in a flexible manner. Our method produces network designs that are more efficient and more accurate than state-of-the-art hand-designed networks. In fact, the generated networks process images at higher resolutions using less computation than previous hand-designed networks at lower resolutions, allowing us to push the boundaries of 2D human pose estimation. Our base network designed via neuroevolution, which we refer to as EvoPose2D-S, achieves comparable accuracy to SimpleBaseline while being 50% faster and 12.7x smaller in terms of file size. Our largest network, EvoPose2D-L, achieves new state-of-the-art accuracy on the Microsoft COCO Keypoints benchmark, is 4.3x smaller than its nearest competitor, and has similar inference speed. The code is publicly available at https://github.com/wmcnally/evopose2d.

Results

TaskDatasetMetricValueModel
Pose EstimationCOCO test-devAP76.8EvoPose2D-L
Pose EstimationCOCO test-devAP5092.5EvoPose2D-L
Pose EstimationCOCO test-devAP7584.3EvoPose2D-L
Pose EstimationCOCO test-devAPL82.5EvoPose2D-L
Pose EstimationCOCO test-devAPM73.5EvoPose2D-L
Pose EstimationCOCO test-devAR81.7EvoPose2D-L
Pose EstimationCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L(512x384)
Pose EstimationCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L(512x384)
Pose EstimationCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L
Pose EstimationCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L
3DCOCO test-devAP76.8EvoPose2D-L
3DCOCO test-devAP5092.5EvoPose2D-L
3DCOCO test-devAP7584.3EvoPose2D-L
3DCOCO test-devAPL82.5EvoPose2D-L
3DCOCO test-devAPM73.5EvoPose2D-L
3DCOCO test-devAR81.7EvoPose2D-L
3DCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L(512x384)
3DCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L(512x384)
3DCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L
3DCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L
Multi-Person Pose EstimationCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L
Multi-Person Pose EstimationCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAP76.8EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAP5092.5EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAP7584.3EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAPL82.5EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAPM73.5EvoPose2D-L
1 Image, 2*2 StitchiCOCO test-devAR81.7EvoPose2D-L
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L(512x384)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L(512x384)
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Test AP76.8EvoPose2D-L
1 Image, 2*2 StitchiCOCO (Common Objects in Context)Validation AP77.5EvoPose2D-L

Related Papers

DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16