TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cascade R-CNN: Delving into High Quality Object Detection

Cascade R-CNN: Delving into High Quality Object Detection

Zhaowei Cai, Nuno Vasconcelos

2017-12-03CVPR 2018 6Vocal Bursts Intensity Prediction2D Object DetectionObject Detection
PaperPDFCodeCodeCodeCode(official)CodeCodeCodeCode

Abstract

In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usually produces noisy detections. However, detection performance tends to degrade with increasing the IoU thresholds. Two main factors are responsible for this: 1) overfitting during training, due to exponentially vanishing positive samples, and 2) inference-time mismatch between the IoUs for which the detector is optimal and those of the input hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, is proposed to address these problems. It consists of a sequence of detectors trained with increasing IoU thresholds, to be sequentially more selective against close false positives. The detectors are trained stage by stage, leveraging the observation that the output of a detector is a good distribution for training the next higher quality detector. The resampling of progressively improved hypotheses guarantees that all detectors have a positive set of examples of equivalent size, reducing the overfitting problem. The same cascade procedure is applied at inference, enabling a closer match between the hypotheses and the detector quality of each stage. A simple implementation of the Cascade R-CNN is shown to surpass all single-model object detectors on the challenging COCO dataset. Experiments also show that the Cascade R-CNN is widely applicable across detector architectures, achieving consistent gains independently of the baseline detector strength. The code will be made available at https://github.com/zhaoweicai/cascade-rcnn.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5062.1Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devAP7546.3Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devAPL55.2Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devAPM45.5Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devAPS23.7Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devbox mAP42.8Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO test-devAP5059.9Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devAP7544Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devAPL52.1Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devAPM42.7Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devAPS22.6Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devbox mAP40.6Cascade R-CNN (ResNet-50-FPN+, cascade)
Object DetectionCOCO test-devAP5061.1Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devAP7541.9Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devAPL49.8Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devAPM41.8Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devAPS21.3Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devbox mAP38.8Cascade R-CNN (ResNet-101-FPN+)
Object DetectionCOCO test-devAP5059Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO test-devAP7539.2Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO test-devAPL46.4Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO test-devAPM38.8Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO test-devAPS20.3Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO test-devbox mAP36.5Cascade R-CNN (ResNet-50-FPN+)
Object DetectionAI-TODAP13.8Cascade R-CNN (ResNet-50-FPN)
Object DetectionAI-TODAP5030.8Cascade R-CNN (ResNet-50-FPN)
Object DetectionAI-TODAP7510.5Cascade R-CNN (ResNet-50-FPN)
Object DetectionAI-TODAPm26.6Cascade R-CNN (ResNet-50-FPN)
Object DetectionAI-TODAPs25.5Cascade R-CNN (ResNet-50-FPN)
Object DetectionAI-TODAPt10.6Cascade R-CNN (ResNet-50-FPN)
Object DetectionCOCO minivalAP5061.6Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalAP7546.6Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalAPL57.4Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalAPM46.2Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalAPS23.8Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalbox AP42.7Cascade R-CNN (ResNet-101-FPN+, cascade)
Object DetectionCOCO minivalAP5059.4Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO minivalAP7543.7Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO minivalAPL54.1Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO minivalAPM43.7Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO minivalAPS22.9Cascade R-CNN (ResNet-50-FPN+)
Object DetectionCOCO minivalbox AP40.3Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devAP5062.1Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devAP7546.3Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devAPL55.2Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devAPM45.5Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devAPS23.7Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devbox mAP42.8Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO test-devAP5059.9Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devAP7544Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devAPL52.1Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devAPM42.7Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devAPS22.6Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devbox mAP40.6Cascade R-CNN (ResNet-50-FPN+, cascade)
3DCOCO test-devAP5061.1Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devAP7541.9Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devAPL49.8Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devAPM41.8Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devAPS21.3Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devbox mAP38.8Cascade R-CNN (ResNet-101-FPN+)
3DCOCO test-devAP5059Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devAP7539.2Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devAPL46.4Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devAPM38.8Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devAPS20.3Cascade R-CNN (ResNet-50-FPN+)
3DCOCO test-devbox mAP36.5Cascade R-CNN (ResNet-50-FPN+)
3DAI-TODAP13.8Cascade R-CNN (ResNet-50-FPN)
3DAI-TODAP5030.8Cascade R-CNN (ResNet-50-FPN)
3DAI-TODAP7510.5Cascade R-CNN (ResNet-50-FPN)
3DAI-TODAPm26.6Cascade R-CNN (ResNet-50-FPN)
3DAI-TODAPs25.5Cascade R-CNN (ResNet-50-FPN)
3DAI-TODAPt10.6Cascade R-CNN (ResNet-50-FPN)
3DCOCO minivalAP5061.6Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalAP7546.6Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalAPL57.4Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalAPM46.2Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalAPS23.8Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalbox AP42.7Cascade R-CNN (ResNet-101-FPN+, cascade)
3DCOCO minivalAP5059.4Cascade R-CNN (ResNet-50-FPN+)
3DCOCO minivalAP7543.7Cascade R-CNN (ResNet-50-FPN+)
3DCOCO minivalAPL54.1Cascade R-CNN (ResNet-50-FPN+)
3DCOCO minivalAPM43.7Cascade R-CNN (ResNet-50-FPN+)
3DCOCO minivalAPS22.9Cascade R-CNN (ResNet-50-FPN+)
3DCOCO minivalbox AP40.3Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devAP5062.1Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devAP7546.3Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devAPL55.2Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devAPM45.5Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devAPS23.7Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devbox mAP42.8Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO test-devAP5059.9Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devAP7544Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devAPL52.1Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devAPM42.7Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devAPS22.6Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devbox mAP40.6Cascade R-CNN (ResNet-50-FPN+, cascade)
2D ClassificationCOCO test-devAP5061.1Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devAP7541.9Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devAPL49.8Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devAPM41.8Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devAPS21.3Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devbox mAP38.8Cascade R-CNN (ResNet-101-FPN+)
2D ClassificationCOCO test-devAP5059Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devAP7539.2Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devAPL46.4Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devAPM38.8Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devAPS20.3Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO test-devbox mAP36.5Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationAI-TODAP13.8Cascade R-CNN (ResNet-50-FPN)
2D ClassificationAI-TODAP5030.8Cascade R-CNN (ResNet-50-FPN)
2D ClassificationAI-TODAP7510.5Cascade R-CNN (ResNet-50-FPN)
2D ClassificationAI-TODAPm26.6Cascade R-CNN (ResNet-50-FPN)
2D ClassificationAI-TODAPs25.5Cascade R-CNN (ResNet-50-FPN)
2D ClassificationAI-TODAPt10.6Cascade R-CNN (ResNet-50-FPN)
2D ClassificationCOCO minivalAP5061.6Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalAP7546.6Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalAPL57.4Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalAPM46.2Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalAPS23.8Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalbox AP42.7Cascade R-CNN (ResNet-101-FPN+, cascade)
2D ClassificationCOCO minivalAP5059.4Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO minivalAP7543.7Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO minivalAPL54.1Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO minivalAPM43.7Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO minivalAPS22.9Cascade R-CNN (ResNet-50-FPN+)
2D ClassificationCOCO minivalbox AP40.3Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionSARDet-100Kbox mAP51.1Cascade R-CNN
2D Object DetectionCOCO test-devAP5062.1Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devAP7546.3Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devAPL55.2Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devAPM45.5Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devAPS23.7Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devbox mAP42.8Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO test-devAP5059.9Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devAP7544Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devAPL52.1Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devAPM42.7Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devAPS22.6Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devbox mAP40.6Cascade R-CNN (ResNet-50-FPN+, cascade)
2D Object DetectionCOCO test-devAP5061.1Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devAP7541.9Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devAPL49.8Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devAPM41.8Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devAPS21.3Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devbox mAP38.8Cascade R-CNN (ResNet-101-FPN+)
2D Object DetectionCOCO test-devAP5059Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO test-devAP7539.2Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO test-devAPL46.4Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO test-devAPM38.8Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO test-devAPS20.3Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO test-devbox mAP36.5Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionAI-TODAP13.8Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionAI-TODAP5030.8Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionAI-TODAP7510.5Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionAI-TODAPm26.6Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionAI-TODAPs25.5Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionAI-TODAPt10.6Cascade R-CNN (ResNet-50-FPN)
2D Object DetectionCOCO minivalAP5061.6Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalAP7546.6Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalAPL57.4Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalAPM46.2Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalAPS23.8Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalbox AP42.7Cascade R-CNN (ResNet-101-FPN+, cascade)
2D Object DetectionCOCO minivalAP5059.4Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO minivalAP7543.7Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO minivalAPL54.1Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO minivalAPM43.7Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO minivalAPS22.9Cascade R-CNN (ResNet-50-FPN+)
2D Object DetectionCOCO minivalbox AP40.3Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devAP5062.1Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devAP7546.3Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devAPL55.2Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devAPM45.5Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devAPS23.7Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devbox mAP42.8Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO test-devAP5059.9Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devAP7544Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devAPL52.1Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devAPM42.7Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devAPS22.6Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devbox mAP40.6Cascade R-CNN (ResNet-50-FPN+, cascade)
16kCOCO test-devAP5061.1Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devAP7541.9Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devAPL49.8Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devAPM41.8Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devAPS21.3Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devbox mAP38.8Cascade R-CNN (ResNet-101-FPN+)
16kCOCO test-devAP5059Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devAP7539.2Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devAPL46.4Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devAPM38.8Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devAPS20.3Cascade R-CNN (ResNet-50-FPN+)
16kCOCO test-devbox mAP36.5Cascade R-CNN (ResNet-50-FPN+)
16kAI-TODAP13.8Cascade R-CNN (ResNet-50-FPN)
16kAI-TODAP5030.8Cascade R-CNN (ResNet-50-FPN)
16kAI-TODAP7510.5Cascade R-CNN (ResNet-50-FPN)
16kAI-TODAPm26.6Cascade R-CNN (ResNet-50-FPN)
16kAI-TODAPs25.5Cascade R-CNN (ResNet-50-FPN)
16kAI-TODAPt10.6Cascade R-CNN (ResNet-50-FPN)
16kCOCO minivalAP5061.6Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalAP7546.6Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalAPL57.4Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalAPM46.2Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalAPS23.8Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalbox AP42.7Cascade R-CNN (ResNet-101-FPN+, cascade)
16kCOCO minivalAP5059.4Cascade R-CNN (ResNet-50-FPN+)
16kCOCO minivalAP7543.7Cascade R-CNN (ResNet-50-FPN+)
16kCOCO minivalAPL54.1Cascade R-CNN (ResNet-50-FPN+)
16kCOCO minivalAPM43.7Cascade R-CNN (ResNet-50-FPN+)
16kCOCO minivalAPS22.9Cascade R-CNN (ResNet-50-FPN+)
16kCOCO minivalbox AP40.3Cascade R-CNN (ResNet-50-FPN+)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07