TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Simple Training Strategies and Model Scaling for Object De...

Simple Training Strategies and Model Scaling for Object Detection

Xianzhi Du, Barret Zoph, Wei-Chih Hung, Tsung-Yi Lin

2021-06-30Semantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCode(official)

Abstract

The speed-accuracy Pareto curve of object detection systems have advanced through a combination of better model architectures, training and inference methods. In this paper, we methodically evaluate a variety of these techniques to understand where most of the improvements in modern detection systems come from. We benchmark these improvements on the vanilla ResNet-FPN backbone with RetinaNet and RCNN detectors. The vanilla detectors are improved by 7.7% in accuracy while being 30% faster in speed. We further provide simple scaling strategies to generate family of models that form two Pareto curves, named RetinaNet-RS and Cascade RCNN-RS. These simple rescaled detectors explore the speed-accuracy trade-off between the one-stage RetinaNet detectors and two-stage RCNN detectors. Our largest Cascade RCNN-RS models achieve 52.9% AP with a ResNet152-FPN backbone and 53.6% with a SpineNet143L backbone. Finally, we show the ResNet architecture, with three minor architectural changes, outperforms EfficientNet as the backbone for object detection and instance segmentation systems.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAPL70.6Cascade RCNN-RS (SpineNet-143L, single scale)
Object DetectionCOCO minivalAPM56.7Cascade RCNN-RS (SpineNet-143L, single scale)
Object DetectionCOCO minivalAPS34.5Cascade RCNN-RS (SpineNet-143L, single scale)
Object DetectionCOCO minivalbox AP53.6Cascade RCNN-RS (SpineNet-143L, single scale)
Object DetectionCOCO minivalAPL70.3Cascade RCNN-RS (ResNet-200, single scale)
Object DetectionCOCO minivalAPM56.2Cascade RCNN-RS (ResNet-200, single scale)
Object DetectionCOCO minivalAPS33.9Cascade RCNN-RS (ResNet-200, single scale)
Object DetectionCOCO minivalbox AP53.1Cascade RCNN-RS (ResNet-200, single scale)
3DCOCO minivalAPL70.6Cascade RCNN-RS (SpineNet-143L, single scale)
3DCOCO minivalAPM56.7Cascade RCNN-RS (SpineNet-143L, single scale)
3DCOCO minivalAPS34.5Cascade RCNN-RS (SpineNet-143L, single scale)
3DCOCO minivalbox AP53.6Cascade RCNN-RS (SpineNet-143L, single scale)
3DCOCO minivalAPL70.3Cascade RCNN-RS (ResNet-200, single scale)
3DCOCO minivalAPM56.2Cascade RCNN-RS (ResNet-200, single scale)
3DCOCO minivalAPS33.9Cascade RCNN-RS (ResNet-200, single scale)
3DCOCO minivalbox AP53.1Cascade RCNN-RS (ResNet-200, single scale)
2D ClassificationCOCO minivalAPL70.6Cascade RCNN-RS (SpineNet-143L, single scale)
2D ClassificationCOCO minivalAPM56.7Cascade RCNN-RS (SpineNet-143L, single scale)
2D ClassificationCOCO minivalAPS34.5Cascade RCNN-RS (SpineNet-143L, single scale)
2D ClassificationCOCO minivalbox AP53.6Cascade RCNN-RS (SpineNet-143L, single scale)
2D ClassificationCOCO minivalAPL70.3Cascade RCNN-RS (ResNet-200, single scale)
2D ClassificationCOCO minivalAPM56.2Cascade RCNN-RS (ResNet-200, single scale)
2D ClassificationCOCO minivalAPS33.9Cascade RCNN-RS (ResNet-200, single scale)
2D ClassificationCOCO minivalbox AP53.1Cascade RCNN-RS (ResNet-200, single scale)
2D Object DetectionCOCO minivalAPL70.6Cascade RCNN-RS (SpineNet-143L, single scale)
2D Object DetectionCOCO minivalAPM56.7Cascade RCNN-RS (SpineNet-143L, single scale)
2D Object DetectionCOCO minivalAPS34.5Cascade RCNN-RS (SpineNet-143L, single scale)
2D Object DetectionCOCO minivalbox AP53.6Cascade RCNN-RS (SpineNet-143L, single scale)
2D Object DetectionCOCO minivalAPL70.3Cascade RCNN-RS (ResNet-200, single scale)
2D Object DetectionCOCO minivalAPM56.2Cascade RCNN-RS (ResNet-200, single scale)
2D Object DetectionCOCO minivalAPS33.9Cascade RCNN-RS (ResNet-200, single scale)
2D Object DetectionCOCO minivalbox AP53.1Cascade RCNN-RS (ResNet-200, single scale)
16kCOCO minivalAPL70.6Cascade RCNN-RS (SpineNet-143L, single scale)
16kCOCO minivalAPM56.7Cascade RCNN-RS (SpineNet-143L, single scale)
16kCOCO minivalAPS34.5Cascade RCNN-RS (SpineNet-143L, single scale)
16kCOCO minivalbox AP53.6Cascade RCNN-RS (SpineNet-143L, single scale)
16kCOCO minivalAPL70.3Cascade RCNN-RS (ResNet-200, single scale)
16kCOCO minivalAPM56.2Cascade RCNN-RS (ResNet-200, single scale)
16kCOCO minivalAPS33.9Cascade RCNN-RS (ResNet-200, single scale)
16kCOCO minivalbox AP53.1Cascade RCNN-RS (ResNet-200, single scale)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17