TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Rethinking ImageNet Pre-training

Rethinking ImageNet Pre-training

Kaiming He, Ross Girshick, Piotr Dollár

2018-11-21ICCV 2019 10Semantic SegmentationInstance Segmentationobject-detectionObject Detection
PaperPDFCode

Abstract

We report competitive results on object detection and instance segmentation on the COCO dataset using standard models trained from random initialization. The results are no worse than their ImageNet pre-training counterparts even when using the hyper-parameters of the baseline system (Mask R-CNN) that were optimized for fine-tuning pre-trained models, with the sole exception of increasing the number of training iterations so the randomly initialized models may converge. Training from random initialization is surprisingly robust; our results hold even when: (i) using only 10% of the training data, (ii) for deeper and wider models, and (iii) for multiple tasks and metrics. Experiments show that ImageNet pre-training speeds up convergence early in training, but does not necessarily provide regularization or improve final target task accuracy. To push the envelope we demonstrate 50.9 AP on COCO object detection without using any external data---a result on par with the top COCO 2017 competition results that used ImageNet pre-training. These observations challenge the conventional wisdom of ImageNet pre-training for dependent tasks and we expect these discoveries will encourage people to rethink the current de facto paradigm of `pre-training and fine-tuning' in computer vision.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5066.8Mask R-CNN (ResNeXt-152-FPN, cascade)
Object DetectionCOCO minivalAP7552.9Mask R-CNN (ResNeXt-152-FPN, cascade)
Object DetectionCOCO minivalbox AP48.6Mask R-CNN (ResNeXt-152-FPN, cascade)
Object DetectionCOCO minivalbox AP47.4Mask R-CNN (ResNet-101-FPN, GN, Cascade)
Object DetectionCOCO minivalAP5067.1Mask R-CNN (ResNeXt-152-FPN)
Object DetectionCOCO minivalAP7551.1Mask R-CNN (ResNeXt-152-FPN)
Object DetectionCOCO minivalbox AP46.4Mask R-CNN (ResNeXt-152-FPN)
3DCOCO minivalAP5066.8Mask R-CNN (ResNeXt-152-FPN, cascade)
3DCOCO minivalAP7552.9Mask R-CNN (ResNeXt-152-FPN, cascade)
3DCOCO minivalbox AP48.6Mask R-CNN (ResNeXt-152-FPN, cascade)
3DCOCO minivalbox AP47.4Mask R-CNN (ResNet-101-FPN, GN, Cascade)
3DCOCO minivalAP5067.1Mask R-CNN (ResNeXt-152-FPN)
3DCOCO minivalAP7551.1Mask R-CNN (ResNeXt-152-FPN)
3DCOCO minivalbox AP46.4Mask R-CNN (ResNeXt-152-FPN)
2D ClassificationCOCO minivalAP5066.8Mask R-CNN (ResNeXt-152-FPN, cascade)
2D ClassificationCOCO minivalAP7552.9Mask R-CNN (ResNeXt-152-FPN, cascade)
2D ClassificationCOCO minivalbox AP48.6Mask R-CNN (ResNeXt-152-FPN, cascade)
2D ClassificationCOCO minivalbox AP47.4Mask R-CNN (ResNet-101-FPN, GN, Cascade)
2D ClassificationCOCO minivalAP5067.1Mask R-CNN (ResNeXt-152-FPN)
2D ClassificationCOCO minivalAP7551.1Mask R-CNN (ResNeXt-152-FPN)
2D ClassificationCOCO minivalbox AP46.4Mask R-CNN (ResNeXt-152-FPN)
2D Object DetectionCOCO minivalAP5066.8Mask R-CNN (ResNeXt-152-FPN, cascade)
2D Object DetectionCOCO minivalAP7552.9Mask R-CNN (ResNeXt-152-FPN, cascade)
2D Object DetectionCOCO minivalbox AP48.6Mask R-CNN (ResNeXt-152-FPN, cascade)
2D Object DetectionCOCO minivalbox AP47.4Mask R-CNN (ResNet-101-FPN, GN, Cascade)
2D Object DetectionCOCO minivalAP5067.1Mask R-CNN (ResNeXt-152-FPN)
2D Object DetectionCOCO minivalAP7551.1Mask R-CNN (ResNeXt-152-FPN)
2D Object DetectionCOCO minivalbox AP46.4Mask R-CNN (ResNeXt-152-FPN)
16kCOCO minivalAP5066.8Mask R-CNN (ResNeXt-152-FPN, cascade)
16kCOCO minivalAP7552.9Mask R-CNN (ResNeXt-152-FPN, cascade)
16kCOCO minivalbox AP48.6Mask R-CNN (ResNeXt-152-FPN, cascade)
16kCOCO minivalbox AP47.4Mask R-CNN (ResNet-101-FPN, GN, Cascade)
16kCOCO minivalAP5067.1Mask R-CNN (ResNeXt-152-FPN)
16kCOCO minivalAP7551.1Mask R-CNN (ResNeXt-152-FPN)
16kCOCO minivalbox AP46.4Mask R-CNN (ResNeXt-152-FPN)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17