TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sparse R-CNN: End-to-End Object Detection with Learnable P...

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei LI, Zehuan Yuan, Changhu Wang, Ping Luo

2020-11-25CVPR 2021 1Object Recognition2D Object Detectionobject-detectionObject Detection
PaperPDFCodeCodeCodeCode(official)CodeCode

Abstract

We present Sparse R-CNN, a purely sparse method for object detection in images. Existing works on object detection heavily rely on dense object candidates, such as $k$ anchor boxes pre-defined on all grids of image feature map of size $H\times W$. In our method, however, a fixed sparse set of learned object proposals, total length of $N$, are provided to object recognition head to perform classification and location. By eliminating $HWk$ (up to hundreds of thousands) hand-designed object candidates to $N$ (e.g. 100) learnable proposals, Sparse R-CNN completely avoids all efforts related to object candidates design and many-to-one label assignment. More importantly, final predictions are directly output without non-maximum suppression post-procedure. Sparse R-CNN demonstrates accuracy, run-time and training convergence performance on par with the well-established detector baselines on the challenging COCO dataset, e.g., achieving 45.0 AP in standard $3\times$ training schedule and running at 22 fps using ResNet-50 FPN model. We hope our work could inspire re-thinking the convention of dense prior in object detectors. The code is available at: https://github.com/PeizeSun/SparseR-CNN.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5064.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAP7549.5Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPL61.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPM48.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPS28.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalbox AP45.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAP5063.4Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAP7548.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPL59.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPM47.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAPS26.9Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalbox AP44.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
Object DetectionCOCO minivalAP5062.1Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalAP7547.2Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalAPL59.7Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalAPM46.3Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalAPS26.1Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalbox AP43.5Sparse R-CNN (ResNet-101, FPN)
Object DetectionCOCO minivalAP5061.2Sparse R-CNN (ResNet-50, FPN)
Object DetectionCOCO minivalAP7545.7Sparse R-CNN (ResNet-50, FPN)
Object DetectionCOCO minivalAPL57.6Sparse R-CNN (ResNet-50, FPN)
Object DetectionCOCO minivalAPM44.6Sparse R-CNN (ResNet-50, FPN)
Object DetectionCOCO minivalAPS26.7Sparse R-CNN (ResNet-50, FPN)
Object DetectionCOCO minivalbox AP42.3Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalAP5064.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalAP7549.5Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPL61.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPM48.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPS28.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalbox AP45.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
3DCOCO minivalAP5063.4Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalAP7548.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPL59.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPM47.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalAPS26.9Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalbox AP44.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
3DCOCO minivalAP5062.1Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalAP7547.2Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalAPL59.7Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalAPM46.3Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalAPS26.1Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalbox AP43.5Sparse R-CNN (ResNet-101, FPN)
3DCOCO minivalAP5061.2Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalAP7545.7Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalAPL57.6Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalAPM44.6Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalAPS26.7Sparse R-CNN (ResNet-50, FPN)
3DCOCO minivalbox AP42.3Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalAP5064.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAP7549.5Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPL61.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPM48.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPS28.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalbox AP45.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAP5063.4Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAP7548.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPL59.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPM47.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAPS26.9Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalbox AP44.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D ClassificationCOCO minivalAP5062.1Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalAP7547.2Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalAPL59.7Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalAPM46.3Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalAPS26.1Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalbox AP43.5Sparse R-CNN (ResNet-101, FPN)
2D ClassificationCOCO minivalAP5061.2Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalAP7545.7Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalAPL57.6Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalAPM44.6Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalAPS26.7Sparse R-CNN (ResNet-50, FPN)
2D ClassificationCOCO minivalbox AP42.3Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionSARDet-100Kbox mAP38.1Sparse R-CNN
2D Object DetectionCeyMomAP47.3Sparse R-CNN
2D Object DetectionCOCO minivalAP5064.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAP7549.5Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPL61.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPM48.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPS28.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalbox AP45.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAP5063.4Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAP7548.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPL59.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPM47.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAPS26.9Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalbox AP44.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
2D Object DetectionCOCO minivalAP5062.1Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalAP7547.2Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalAPL59.7Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalAPM46.3Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalAPS26.1Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalbox AP43.5Sparse R-CNN (ResNet-101, FPN)
2D Object DetectionCOCO minivalAP5061.2Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionCOCO minivalAP7545.7Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionCOCO minivalAPL57.6Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionCOCO minivalAPM44.6Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionCOCO minivalAPS26.7Sparse R-CNN (ResNet-50, FPN)
2D Object DetectionCOCO minivalbox AP42.3Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalAP5064.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalAP7549.5Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPL61.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPM48.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPS28.3Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalbox AP45.6Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
16kCOCO minivalAP5063.4Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalAP7548.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPL59.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPM47.2Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalAPS26.9Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalbox AP44.5Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
16kCOCO minivalAP5062.1Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalAP7547.2Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalAPL59.7Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalAPM46.3Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalAPS26.1Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalbox AP43.5Sparse R-CNN (ResNet-101, FPN)
16kCOCO minivalAP5061.2Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalAP7545.7Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalAPL57.6Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalAPM44.6Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalAPS26.7Sparse R-CNN (ResNet-50, FPN)
16kCOCO minivalbox AP42.3Sparse R-CNN (ResNet-50, FPN)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08