TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/Cascade R-CNN

Cascade R-CNN

Computer VisionIntroduced 200034 papers
Source Paper

Description

Cascade R-CNN is an object detection architecture that seeks to address problems with degrading performance with increased IoU thresholds (due to overfitting during training and inference-time mismatch between IoUs for which detector is optimal and the inputs). It is a multi-stage extension of the R-CNN, where detector stages deeper into the cascade are sequentially more selective against close false positives. The cascade of R-CNN stages are trained sequentially, using the output of one stage to train the next. This is motivated by the observation that the output IoU of a regressor is almost invariably better than the input IoU.

Cascade R-CNN does not aim to mine hard negatives. Instead, by adjusting bounding boxes, each stage aims to find a good set of close false positives for training the next stage. When operating in this manner, a sequence of detectors adapted to increasingly higher IoUs can beat the overfitting problem, and thus be effectively trained. At inference, the same cascade procedure is applied. The progressively improved hypotheses are better matched to the increasing detector quality at each stage.

Papers Using This Method

AI-Driven MRI Spine Pathology Detection: A Comprehensive Deep Learning Approach for Automated Diagnosis in Diverse Clinical Settings2025-03-26Enhancing Tree Type Detection in Forest Fire Risk Assessment: Multi-Stage Approach and Color Encoding with Forest Fire Risk Evaluation Framework for UAV Imagery2024-07-27On Feasibility of Intent Obfuscating Attacks2024-07-22FAD-SAR: A Novel Fishing Activity Detection System via Synthetic Aperture Radar Images Based on Deep Learning Method2024-04-28Rethinking Detection Based Table Structure Recognition for Visually Rich Document Images2023-12-01Semi-Supervised and Long-Tailed Object Detection with CascadeMatch2023-05-24Context-Aware Chart Element Detection2023-05-07FQDet: Fast-converging Query-based Detector2022-10-05ComplETR: Reducing the cost of annotations for object detection in dense scenes with vision transformers2022-09-13Lost in Compression: the Impact of Lossy Image Compression on Variable Size Object Detection within Infrared Imagery2022-05-16Self-Normalized Density Map (SNDM) for Counting Microbiological Objects2022-03-15Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery2022-01-18Automatic Detection of Injection and Press Mold Parts on 2D Drawing Using Deep Neural Network2021-10-22Operationalizing Convolutional Neural Network Architectures for Prohibited Object Detection in X-Ray Imagery2021-10-10Instances as Queries2021-05-05OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection2021-03-08Augmenting Proposals by the Detector Itself2021-01-28SyNet: An Ensemble Network for Object Detection in UAV Images2020-12-23Hierarchical Context Embedding for Region-based Object Detection2020-08-04A Solution to Product detection in Densely Packed Scenes2020-07-23