TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cascade R-CNN: High Quality Object Detection and Instance ...

Cascade R-CNN: High Quality Object Detection and Instance Segmentation

Zhaowei Cai, Nuno Vasconcelos

2019-06-24Vocal Bursts Intensity PredictionSemantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCodeCode(official)Code(official)Code

Abstract

In object detection, the intersection over union (IoU) threshold is frequently used to define positives/negatives. The threshold used to train a detector defines its \textit{quality}. While the commonly used threshold of 0.5 leads to noisy (low-quality) detections, detection performance frequently degrades for larger thresholds. This paradox of high-quality detection has two causes: 1) overfitting, due to vanishing positive samples for large thresholds, and 2) inference-time quality mismatch between detector and test hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, composed of a sequence of detectors trained with increasing IoU thresholds, is proposed to address these problems. The detectors are trained sequentially, using the output of a detector as training set for the next. This resampling progressively improves hypotheses quality, guaranteeing a positive training set of equivalent size for all detectors and minimizing overfitting. The same cascade is applied at inference, to eliminate quality mismatches between hypotheses and detectors. An implementation of the Cascade R-CNN without bells or whistles achieves state-of-the-art performance on the COCO dataset, and significantly improves high-quality detection on generic and specific object detection datasets, including VOC, KITTI, CityPerson, and WiderFace. Finally, the Cascade R-CNN is generalized to instance segmentation, with nontrivial improvements over the Mask R-CNN. To facilitate future research, two implementations are made available at \url{https://github.com/zhaoweicai/cascade-rcnn} (Caffe) and \url{https://github.com/zhaoweicai/Detectron-Cascade-RCNN} (Detectron).

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5062.1Cascade R-CNN
Object DetectionCOCO test-devAP7546.3Cascade R-CNN
Object DetectionCOCO test-devAPL55.2Cascade R-CNN
Object DetectionCOCO test-devAPM45.5Cascade R-CNN
Object DetectionCOCO test-devAPS23.7Cascade R-CNN
Object DetectionCOCO test-devbox mAP42.8Cascade R-CNN
Object DetectionCOCO-OAverage mAP18.2Cascade R-CNN (ResNet-50)
Object DetectionCOCO-OEffective Robustness0.02Cascade R-CNN (ResNet-50)
3DCOCO test-devAP5062.1Cascade R-CNN
3DCOCO test-devAP7546.3Cascade R-CNN
3DCOCO test-devAPL55.2Cascade R-CNN
3DCOCO test-devAPM45.5Cascade R-CNN
3DCOCO test-devAPS23.7Cascade R-CNN
3DCOCO test-devbox mAP42.8Cascade R-CNN
3DCOCO-OAverage mAP18.2Cascade R-CNN (ResNet-50)
3DCOCO-OEffective Robustness0.02Cascade R-CNN (ResNet-50)
Instance SegmentationBDD100K valAP19.8Cascade Mask R-CNN
2D ClassificationCOCO test-devAP5062.1Cascade R-CNN
2D ClassificationCOCO test-devAP7546.3Cascade R-CNN
2D ClassificationCOCO test-devAPL55.2Cascade R-CNN
2D ClassificationCOCO test-devAPM45.5Cascade R-CNN
2D ClassificationCOCO test-devAPS23.7Cascade R-CNN
2D ClassificationCOCO test-devbox mAP42.8Cascade R-CNN
2D ClassificationCOCO-OAverage mAP18.2Cascade R-CNN (ResNet-50)
2D ClassificationCOCO-OEffective Robustness0.02Cascade R-CNN (ResNet-50)
2D Object DetectionCOCO test-devAP5062.1Cascade R-CNN
2D Object DetectionCOCO test-devAP7546.3Cascade R-CNN
2D Object DetectionCOCO test-devAPL55.2Cascade R-CNN
2D Object DetectionCOCO test-devAPM45.5Cascade R-CNN
2D Object DetectionCOCO test-devAPS23.7Cascade R-CNN
2D Object DetectionCOCO test-devbox mAP42.8Cascade R-CNN
2D Object DetectionCOCO-OAverage mAP18.2Cascade R-CNN (ResNet-50)
2D Object DetectionCOCO-OEffective Robustness0.02Cascade R-CNN (ResNet-50)
16kCOCO test-devAP5062.1Cascade R-CNN
16kCOCO test-devAP7546.3Cascade R-CNN
16kCOCO test-devAPL55.2Cascade R-CNN
16kCOCO test-devAPM45.5Cascade R-CNN
16kCOCO test-devAPS23.7Cascade R-CNN
16kCOCO test-devbox mAP42.8Cascade R-CNN
16kCOCO-OAverage mAP18.2Cascade R-CNN (ResNet-50)
16kCOCO-OEffective Robustness0.02Cascade R-CNN (ResNet-50)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17