TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Objects as Points

Objects as Points

Xingyi Zhou, Dequan Wang, Philipp Krähenbühl

2019-04-16Real-Time Object DetectionOne-stage Anchor-free Oriented Object DetectionKeypoint Detection3D Object DetectionObject DetectionKeypoint Estimation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCode

Abstract

Detection identifies objects as axis-aligned boxes in an image. Most successful object detectors enumerate a nearly exhaustive list of potential object locations and classify each. This is wasteful, inefficient, and requires additional post-processing. In this paper, we take a different approach. We model an object as a single point --- the center point of its bounding box. Our detector uses keypoint estimation to find center points and regresses to all other object properties, such as size, 3D location, orientation, and even pose. Our center point based approach, CenterNet, is end-to-end differentiable, simpler, faster, and more accurate than corresponding bounding box based detectors. CenterNet achieves the best speed-accuracy trade-off on the MS COCO dataset, with 28.1% AP at 142 FPS, 37.4% AP at 52 FPS, and 45.1% AP with multi-scale testing at 1.4 FPS. We use the same approach to estimate 3D bounding box in the KITTI benchmark and human pose on the COCO keypoint dataset. Our method performs competitively with sophisticated multi-stage methods and runs in real-time.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAPL56CenterNet-DLA (DLA-34, multi-scale)
Object DetectionCOCO test-devAPM43.9CenterNet-DLA (DLA-34, multi-scale)
Object DetectionCOCO test-devAPS21.5CenterNet-DLA (DLA-34, multi-scale)
Object DetectionCOCO test-devbox mAP41.6CenterNet-DLA (DLA-34, multi-scale)
Object DetectionUA-DETRACmAP83.48CenterNet
3DCOCO test-devAPL56CenterNet-DLA (DLA-34, multi-scale)
3DCOCO test-devAPM43.9CenterNet-DLA (DLA-34, multi-scale)
3DCOCO test-devAPS21.5CenterNet-DLA (DLA-34, multi-scale)
3DCOCO test-devbox mAP41.6CenterNet-DLA (DLA-34, multi-scale)
3DUA-DETRACmAP83.48CenterNet
2D ClassificationCOCO test-devAPL56CenterNet-DLA (DLA-34, multi-scale)
2D ClassificationCOCO test-devAPM43.9CenterNet-DLA (DLA-34, multi-scale)
2D ClassificationCOCO test-devAPS21.5CenterNet-DLA (DLA-34, multi-scale)
2D ClassificationCOCO test-devbox mAP41.6CenterNet-DLA (DLA-34, multi-scale)
2D ClassificationUA-DETRACmAP83.48CenterNet
2D Object DetectionCOCO test-devAPL56CenterNet-DLA (DLA-34, multi-scale)
2D Object DetectionCOCO test-devAPM43.9CenterNet-DLA (DLA-34, multi-scale)
2D Object DetectionCOCO test-devAPS21.5CenterNet-DLA (DLA-34, multi-scale)
2D Object DetectionCOCO test-devbox mAP41.6CenterNet-DLA (DLA-34, multi-scale)
2D Object DetectionUA-DETRACmAP83.48CenterNet
16kCOCO test-devAPL56CenterNet-DLA (DLA-34, multi-scale)
16kCOCO test-devAPM43.9CenterNet-DLA (DLA-34, multi-scale)
16kCOCO test-devAPS21.5CenterNet-DLA (DLA-34, multi-scale)
16kCOCO test-devbox mAP41.6CenterNet-DLA (DLA-34, multi-scale)
16kUA-DETRACmAP83.48CenterNet

Related Papers

Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16KptLLM++: Towards Generic Keypoint Comprehension with Large Language Model2025-07-15GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft2025-07-15Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15