TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ResNet strikes back: An improved training procedure in timm

ResNet strikes back: An improved training procedure in timm

Ross Wightman, Hugo Touvron, Hervé Jégou

2021-10-01NeurIPS Workshop ImageNet_PPF 2021 12Image ClassificationData AugmentationDomain GeneralizationMedical Image ClassificationClassificationFine-Grained Image Classification
PaperPDFCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

The influential Residual Networks designed by He et al. remain the gold-standard architecture in numerous scientific publications. They typically serve as the default architecture in studies, or as baselines when new architectures are proposed. Yet there has been significant progress on best practices for training neural networks since the inception of the ResNet architecture in 2015. Novel optimization & data-augmentation have increased the effectiveness of the training recipes. In this paper, we re-evaluate the performance of the vanilla ResNet-50 when trained with a procedure that integrates such advances. We share competitive training settings and pre-trained models in the timm open-source library, with the hope that they will serve as better baselines for future work. For instance, with our more demanding training setting, a vanilla ResNet-50 reaches 80.4% top-1 accuracy at resolution 224x224 on ImageNet-val without extra data or distillation. We also report the performance achieved with popular models with our training procedure.

Results

TaskDatasetMetricValueModel
Domain AdaptationVizWiz-ClassificationAccuracy - All Images48.9ResNet-50 (gn)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images44.4ResNet-50 (gn)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images39.1ResNet-50 (gn)
Image ClassificationImageNet V2Top 1 Accuracy68.7ResNet50 (A1)
Image ClassificationCIFAR-10Percentage correct98.3ResNet50 (A1)
Image ClassificationCIFAR-10Percentage correct85.28cvpr_class
Image ClassificationFlowers-102Accuracy97.9ResNet50 (A1)
Image ClassificationFlowers-102FLOPS4.1ResNet50 (A1)
Image ClassificationiNaturalist 2019Top-1 Accuracy75ResNet50 (A2)
Image ClassificationCIFAR-100Percentage correct86.9ResNet50 (A1)
Image ClassificationOxford 102 FlowersFLOPS4.1ResNet50 (A1)
Fine-Grained Image ClassificationOxford 102 FlowersFLOPS4.1ResNet50 (A1)
ClassificationNCT-CRC-HE-100KAccuracy (%)95.46ResNeXt-50-32x4d
ClassificationNCT-CRC-HE-100KF1-Score97.46ResNeXt-50-32x4d
ClassificationNCT-CRC-HE-100KPrecision99.91ResNeXt-50-32x4d
ClassificationNCT-CRC-HE-100KSpecificity99.43ResNeXt-50-32x4d
Medical Image ClassificationNCT-CRC-HE-100KAccuracy (%)95.46ResNeXt-50-32x4d
Medical Image ClassificationNCT-CRC-HE-100KF1-Score97.46ResNeXt-50-32x4d
Medical Image ClassificationNCT-CRC-HE-100KPrecision99.91ResNeXt-50-32x4d
Medical Image ClassificationNCT-CRC-HE-100KSpecificity99.43ResNeXt-50-32x4d
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images48.9ResNet-50 (gn)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images44.4ResNet-50 (gn)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images39.1ResNet-50 (gn)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17