TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ResMLP: Feedforward networks for image classification with...

ResMLP: Feedforward networks for image classification with data-efficient training

Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel Synnaeve, Jakob Verbeek, Hervé Jégou

2021-05-07NeurIPS 2021 12Machine TranslationSelf-Supervised Image ClassificationImage ClassificationData AugmentationTranslationGeneral ClassificationFine-Grained Image Classification
PaperPDFCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.

Results

TaskDatasetMetricValueModel
Machine TranslationWMT2014 English-GermanBLEU score26.8ResMLP-12
Machine TranslationWMT2014 English-GermanBLEU score26.4ResMLP-6
Machine TranslationWMT2014 English-FrenchBLEU score40.6ResMLP-12
Machine TranslationWMT2014 English-FrenchBLEU score40.3ResMLP-6
Image ClassificationStanford CarsAccuracy89.5ResMLP-24
Image ClassificationStanford CarsAccuracy84.6ResMLP-12
Image ClassificationImageNet V2Top 1 Accuracy74.2ResMLP-B24/8 22k
Image ClassificationImageNet V2Top 1 Accuracy73.4ResMLP-B24/8
Image ClassificationImageNet V2Top 1 Accuracy69.8ResMLP-S24/16
Image ClassificationImageNet V2Top 1 Accuracy66ResMLP-S12/16
Image ClassificationiNaturalist 2018Top-1 Accuracy64.3ResMLP-24
Image ClassificationiNaturalist 2018Top-1 Accuracy60.2ResMLP-12
Image ClassificationFlowers-102Accuracy97.9ResMLP24
Image ClassificationFlowers-102Accuracy97.4ResMLP12
Image ClassificationCertificate VerificationPercentage correct98.7ResMLP-24
Image ClassificationCertificate VerificationTop-1 Accuracy98.7ResMLP-24
Image ClassificationCertificate VerificationPercentage correct98.1ResMLP-12
Image ClassificationCertificate VerificationTop-1 Accuracy98.1ResMLP-12
Image ClassificationiNaturalist 2019Top-1 Accuracy72.5ResMLP-24
Image ClassificationiNaturalist 2019Top-1 Accuracy71ResMLP-12
Image ClassificationCIFAR-100Percentage correct89.5ResMLP-24
Image ClassificationCIFAR-100Percentage correct87ResMLP-12
Image ClassificationImageNetGFLOPs6ResMLP-S24
Image ClassificationImageNetGFLOPs3ResMLP-12 (distilled, class-MLP)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17