Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel Synnaeve, Jakob Verbeek, Hervé Jégou
We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Machine Translation | WMT2014 English-German | BLEU score | 26.8 | ResMLP-12 |
| Machine Translation | WMT2014 English-German | BLEU score | 26.4 | ResMLP-6 |
| Machine Translation | WMT2014 English-French | BLEU score | 40.6 | ResMLP-12 |
| Machine Translation | WMT2014 English-French | BLEU score | 40.3 | ResMLP-6 |
| Image Classification | Stanford Cars | Accuracy | 89.5 | ResMLP-24 |
| Image Classification | Stanford Cars | Accuracy | 84.6 | ResMLP-12 |
| Image Classification | ImageNet V2 | Top 1 Accuracy | 74.2 | ResMLP-B24/8 22k |
| Image Classification | ImageNet V2 | Top 1 Accuracy | 73.4 | ResMLP-B24/8 |
| Image Classification | ImageNet V2 | Top 1 Accuracy | 69.8 | ResMLP-S24/16 |
| Image Classification | ImageNet V2 | Top 1 Accuracy | 66 | ResMLP-S12/16 |
| Image Classification | iNaturalist 2018 | Top-1 Accuracy | 64.3 | ResMLP-24 |
| Image Classification | iNaturalist 2018 | Top-1 Accuracy | 60.2 | ResMLP-12 |
| Image Classification | Flowers-102 | Accuracy | 97.9 | ResMLP24 |
| Image Classification | Flowers-102 | Accuracy | 97.4 | ResMLP12 |
| Image Classification | Certificate Verification | Percentage correct | 98.7 | ResMLP-24 |
| Image Classification | Certificate Verification | Top-1 Accuracy | 98.7 | ResMLP-24 |
| Image Classification | Certificate Verification | Percentage correct | 98.1 | ResMLP-12 |
| Image Classification | Certificate Verification | Top-1 Accuracy | 98.1 | ResMLP-12 |
| Image Classification | iNaturalist 2019 | Top-1 Accuracy | 72.5 | ResMLP-24 |
| Image Classification | iNaturalist 2019 | Top-1 Accuracy | 71 | ResMLP-12 |
| Image Classification | CIFAR-100 | Percentage correct | 89.5 | ResMLP-24 |
| Image Classification | CIFAR-100 | Percentage correct | 87 | ResMLP-12 |
| Image Classification | ImageNet | GFLOPs | 6 | ResMLP-S24 |
| Image Classification | ImageNet | GFLOPs | 3 | ResMLP-12 (distilled, class-MLP) |