TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Classification/CIFAR-100

Image Classification on CIFAR-100

Metric: Percentage correct (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Percentage correct▼Extra DataPaperDate↕Code
1EffNet-L2 (SAM)96.08YesSharpness-Aware Minimization for Efficiently Imp...2020-10-03Code
2Swin-L + ML-Decoder95.1YesML-Decoder: Scalable and Versatile Classificatio...2021-11-25Code
3µ2Net (ViT-L/16)94.95YesAn Evolutionary Approach to Dynamic Introduction...2022-05-25Code
4ViT-B-16 (ImageNet-21K-P pretrain)94.2YesImageNet-21K Pretraining for the Masses2021-04-22Code
5CvT-W2494.09YesCvT: Introducing Convolutions to Vision Transfor...2021-03-29Code
6ViT-B/16 (PUGD)93.95YesPerturbated Gradients Updating within Unit Space...2021-10-01Code
7Heinsen Routing + BEiT-large 16 22493.8YesAn Algorithm for Routing Vectors in Sequences2022-11-20Code
8BiT-L (ResNet)93.51YesBig Transfer (BiT): General Visual Representatio...2019-12-24Code
9VIT-L/16 (Spinal FC, Background)93.31NoReduction of Class Activation Uncertainty with B...2023-05-05Code
10CaiT-M-36 U 22493.1YesGoing deeper with Image Transformers2021-03-31Code
11ViT-L (attn fine-tune)93YesThree things everyone should know about Vision T...2022-03-18Code
12TResNet-L-V292.6YesTResNet: High Performance GPU-Dedicated Architec...2020-03-30Code
13EfficientNetV2-L92.3YesEfficientNetV2: Smaller Models and Faster Training2021-04-01Code
14EfficientNetV2-M92.2YesEfficientNetV2: Smaller Models and Faster Training2021-04-01Code
15BiT-M (ResNet)92.17YesBig Transfer (BiT): General Visual Representatio...2019-12-24Code
16CeiT-S91.8YesIncorporating Convolution Designs into Visual Tr...2021-03-22Code
17CeiT-S (384 finetune resolution)91.8YesIncorporating Convolution Designs into Visual Tr...2021-03-22Code
18EfficientNet-B791.7YesEfficientNet: Rethinking Model Scaling for Convo...2019-05-28Code
19EfficientNetV2-S91.5YesEfficientNetV2: Smaller Models and Faster Training2021-04-01Code
20GPIPE91.3YesGPipe: Efficient Training of Giant Neural Networ...2018-11-16Code
21TNT-B91.1YesTransformer in Transformer2021-02-27Code
22DeiT-B90.8YesTraining data-efficient image transformers & dis...2020-12-23Code
23GFNet-H-B90.3YesGlobal Filter Networks for Image Classification2021-07-01Code
24E2E-3M90.27YesRethinking Recurrent Neural Networks and Other I...2020-07-30Code
25Bamboo (ViT-B/16)90.2YesBamboo: Building Mega-Scale Vision Dataset Conti...2022-03-15Code
26PyramidNet-272 (ASAM)89.9NoASAM: Adaptive Sharpness-Aware Minimization for ...2021-02-23Code
27PyramidNet (SAM)89.7NoSharpness-Aware Minimization for Efficiently Imp...2020-10-03Code
28DVT (T2T-ViT-24)89.63YesNot All Images are Worth 16x16 Words: Dynamic Tr...2021-05-31Code
29ResMLP-2489.5YesResMLP: Feedforward networks for image classific...2021-05-07Code
30PyramidNet-272, S=489.46YesTowards Better Accuracy-efficiency Trade-offs: D...2020-11-30Code
31CeiT-T89.4YesIncorporating Convolution Designs into Visual Tr...2021-03-22Code
32PyramidNet+ShakeDrop89.3YesAutoAugment: Learning Augmentation Policies from...2018-05-24Code
33ViT-B/16- SAM89.1YesWhen Vision Transformers Outperform ResNets with...2021-06-03Code
34ConvMLP-M89.1YesConvMLP: Hierarchical Convolutional MLPs for Vis...2021-09-09Code
35ConvMLP-L88.6YesConvMLP: Hierarchical Convolutional MLPs for Vis...2021-09-09Code
36ResNet-152x4-AGC (ImageNet-21K)88.54YesEffect of Pre-Training Scale on Intra- and Inter...2021-05-31Code
37ColorNet88.4YesColorNet: Investigating the importance of color ...2019-02-01Code
38PyramidNet+ShakeDrop (Fast AA)88.3YesFast AutoAugment2019-05-01Code
39NAT-M488.3YesNeural Architecture Transfer2020-05-12Code
40CeiT-T (384 finetune resolution)88YesIncorporating Convolution Designs into Visual Tr...2021-03-22Code
41NAT-M387.7YesNeural Architecture Transfer2020-05-12Code
42ViT-S/16- SAM87.6YesWhen Vision Transformers Outperform ResNets with...2021-06-03Code
43NAT-M287.5YesNeural Architecture Transfer2020-05-12Code
44Dynamics 187.48YesPSO-Convolutional Neural Networks with Heterogen...2022-05-20Code
45DenseNet-BC-190, S=487.44YesTowards Better Accuracy-efficiency Trade-offs: D...2020-11-30Code
46ConvMLP-S87.4YesConvMLP: Hierarchical Convolutional MLPs for Vis...2021-09-09Code
47ResMLP-1287YesResMLP: Feedforward networks for image classific...2021-05-07Code
48WRN-40-10, S=486.9YesTowards Better Accuracy-efficiency Trade-offs: D...2020-11-30Code
49ResNet50 (A1)86.9YesResNet strikes back: An improved training proced...2021-10-01Code
50WRN-28-10 * 386.81YesMixMo: Mixing Multiple Inputs for Multiple Outpu...2021-03-10Code
51PyramidNet + AA (AMP)86.64YesRegularizing Neural Networks via Adversarial Mod...2020-10-10Code
52PyramidNet-200 + Shakedrop + Cutmix + PS-KD86.41YesSelf-Knowledge Distillation with Progressive Ref...2020-06-22Code
53Mixer-B/16- SAM86.4YesWhen Vision Transformers Outperform ResNets with...2021-06-03Code
54ResCNet-5086.31NoDeep Feature Response Discriminative Calibration2024-11-16Code
55PyramidNet-200 + Shakedrop + Cutmix86.19YesCutMix: Regularization Strategy to Train Strong ...2019-05-13Code
56MUXNet-m86.1YesMUXConv: Information Multiplexing in Convolution...2020-03-31Code
57NAT-M186YesNeural Architecture Transfer2020-05-12Code
58WRN-28-1085.77YesMixMo: Mixing Multiple Inputs for Multiple Outpu...2021-03-10Code
59WRN-28-10, S=485.74YesTowards Better Accuracy-efficiency Trade-offs: D...2020-11-30Code
60WRN-28-8 (SAMix+DM)85.59No---
61WRN-28-8 +SAMix85.5YesBoosting Discriminative Visual Representation Le...2021-11-30Code
62ASANas85.42YesImproving Neural Architecture Search Image Class...2019-03-14Code
63WRN-28-8 (AutoMix+DM)85.38No---
64SparseSwin85.35YesSparseSwin: Swin Transformer with Sparse Transfo...2023-09-11Code
65WRN-28-8 (PuzzleMix+DM)85.25No---
66ResNet-50-SAM85.2YesWhen Vision Transformers Outperform ResNets with...2021-06-03Code
67WRN-28-8 +AutoMix85.16YesAutoMix: Unveiling the Power of Mixup for Strong...2021-03-24Code
68WaveMixLite-256/785.09YesWaveMix: A Resource-efficient Neural Network for...2022-05-28Code
69MANO-tiny85.08YesLinear Attention with Global Context: A Multipol...2025-07-03Code
70WRN 28-1485YesNeural networks with late-phase weights2020-07-25Code
71R-Mix (WideResNet 28-10)85YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
72EEEA-Net-C (b=5)+ CO84.98YesEEEA-Net: An Early Exit Evolutionary Neural Arch...2021-08-13Code
73RL-Mix (WideResNet 28-10)84.9YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
74Wide-ResNet-28-1084.89YesAutomatic Data Augmentation via Invariance-Const...2022-09-29Code
75SENet + ShakeEven + Cutout84.59YesSqueeze-and-Excitation Networks2017-09-05Code
76ResNeXt-50(32x4d) + SAMix84.42YesBoosting Discriminative Visual Representation Le...2021-11-30Code
77WRN-28-10 with reSGHMC84.38YesNon-convex Learning via Replica Exchange Stochas...2020-08-12Code
78PyramidNet-272 + SWA84.16YesAveraging Weights Leads to Wider Optima and Bett...2018-03-14Code
79WRN28-1084.05YesPuzzle Mix: Exploiting Saliency and Local Statis...2020-09-15Code
80HCGNet-A384.04YesGated Convolutional Networks with Hybrid Connect...2019-08-26Code
81WideResNet 28-10 + CutMix (OneCycleLR scheduler)83.97YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
82DenseNet-BC-190 + FMix83.95YesFMix: Enhancing Mixed Sample Data Augmentation2020-02-27Code
83ORN83.85YesOriented Response Networks2017-01-07Code
84Grafit (ResNet-50)83.7YesGrafit: Learning fine-grained image representati...2020-11-25-
85ResNeXt-50(32x4d) + AutoMix83.64YesAutoMix: Unveiling the Power of Mixup for Strong...2021-03-24Code
86CCT-7/3x1+HTM+VTM83.57YesTokenMixup: Efficient Attention-guided Token-lev...2022-10-14Code
87HCGNet-A283.46YesGated Convolutional Networks with Hybrid Connect...2019-08-26Code
88Res2NeXt-2983.44YesRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
89DenseNet-BC-190 + Mixup83.2Yesmixup: Beyond Empirical Risk Minimization2017-10-25Code
90SSAL-DenseNet 190-4083.2YesContextual Classification Using Self-Supervised ...2021-01-07Code
91EnAET83.13YesEnAET: A Self-Trained framework for Semi-Supervi...2019-11-21Code
92WRN 28-1083.06YesNeural networks with late-phase weights2020-07-25Code
93R-Mix (ResNeXt 29-4-24)83.02YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
94Wide ResNet+Cutout+no BN scale/offset learning82.95YesSingle-bit-per-weight deep convolutional neural ...2019-07-16Code
95WRN-16-8 with reSGHMC82.95YesNon-convex Learning via Replica Exchange Stochas...2020-08-12Code
96DenseNet-BC82.82YesDensely Connected Convolutional Networks2016-08-25Code
97ABNet-2G-R3-Combined82.784NoANDHRA Bandersnatch: Training Neural Networks to...2024-11-28Code
98CCT-7/3x1*82.72YesEscaping the Big Data Paradigm with Compact Tran...2021-04-12Code
99EXACT (WRN-28-10)82.68NoEXACT: How to Train Your Accuracy2022-05-19Code
100SKNet-29 (ResNeXt-29, 16×32d)82.67YesSelective Kernel Networks2019-03-15Code
101DenseNet82.62YesDensely Connected Convolutional Networks2016-08-25Code
102Shared WRN82.57YesLearning Implicitly Recurrent CNNs Through Param...2019-02-26Code
103Transformer local-attention (NesT-B)82.56YesNested Hierarchical Transformer: Towards Accurat...2021-05-26Code
104RL-Mix (ResNeXt 29-4-24)82.43YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
105Mixer-S/16- SAM82.4YesWhen Vision Transformers Outperform ResNets with...2021-06-03Code
106R-Mix (WideResNet 16-8)82.32YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
107ResNeXt 29-4-24 + CutMix (OneCycleLR scheduler)82.3YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
108WARN82.18YesAttend and Rectify: a Gated Attention Mechanism ...2018-07-19Code
109RL-Mix (WideResNet 16-8)82.16YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
110WRN+SWA82.15YesAveraging Weights Leads to Wider Optima and Bett...2018-03-14Code
111Manifold Mixup81.96YesManifold Mixup: Better Representations by Interp...2018-06-13Code
112HCGNet-A181.87YesGated Convolutional Networks with Hybrid Connect...2019-08-26Code
113WideResNet 16-8 + CutMix (OneCycleLR scheduler)81.79YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
114Residual Gates + WRN81.73YesLearning Identity Mappings with Residual Gates2016-11-04-
115kNN-CLIP81.7YesRevisiting a kNN-based Image Classification Syst...2022-04-03-
116AA-Wide-ResNet81.6YesAttention Augmented Convolutional Networks2019-04-22Code
117PDO-eConv (p8, 4.6M)81.6YesPDO-eConvs: Partial Differential Operator Based ...2020-07-20Code
118SEER (RegNet10B)81.53YesVision Models Are More Robust And Fair When Pret...2022-02-16Code
119R-Mix (PreActResNet-18)81.49YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
120ResNet50 (FSGDM)81.44NoOn the Performance Analysis of Momentum Method: ...2024-11-29Code
121Wide-ResNet-40-281.19YesAutomatic Data Augmentation via Invariance-Const...2022-09-29Code
122Wide ResNet81.15YesWide Residual Networks2016-05-23Code
123CoPaNet-R-16481.1YesDeep Competitive Pathway Networks2017-09-29Code
124ABNet-2G-R380.83NoANDHRA Bandersnatch: Training Neural Networks to...2024-11-28Code
125RL-Mix (PreActResNet-18)80.75YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
126PreActResNet-18 + CutMix (OneCycleLR scheduler)80.6YesExpeditious Saliency-guided Mix-up through Rando...2022-12-09Code
127GAC-SNN80.45NoGated Attention Coding for Training High-perform...2023-08-12Code
128ABNet-2G-R280.354NoANDHRA Bandersnatch: Training Neural Networks to...2024-11-28Code
129SimpleNetv280.29YesTowards Principled Design of Deep Convolutional ...2018-02-17Code
130UPANets80.29YesUPANets: Learning from the Universal Pixel Atten...2021-03-15Code
131PreActResNet-18 + SageMix80.16YesSageMix: Saliency-Guided Mixup for Point Clouds2022-10-13Code
132ResNet56 with reSGHMC80.14YesNon-convex Learning via Replica Exchange Stochas...2020-08-12Code
133PDO-eConv (p8, 2.62M)79.99YesPDO-eConvs: Partial Differential Operator Based ...2020-07-20Code
134VGG11B(3x) + LocalLearning79.9YesTraining Neural Networks with Local Error Signals2019-01-20Code
135NNCLR79YesWith a Little Help from My Friends: Nearest-Neig...2021-04-29Code
136ABNet-2G-R178.792NoANDHRA Bandersnatch: Training Neural Networks to...2024-11-28Code
137PreActResNet18 (AMP)78.49YesRegularizing Neural Networks via Adversarial Mod...2020-10-10Code
138SimpleNetv178.37YesLets keep it simple, Using simple architectures ...2016-08-22Code
139ViT (lightweight, MAE pre-trained)78.27NoPre-training of Lightweight Vision Transformers ...2024-02-06-
140PDC77.9YesAugmenting Deep Classifiers with Polynomial Neur...2021-04-16Code
141MobileNetV3-large x1.0 (BSConv-U)77.7YesRethinking Depthwise Separable Convolutions: How...2020-03-30Code
142CCT-6/3x177.31YesEscaping the Big Data Paradigm with Compact Tran...2021-04-12Code
143ResNet-100177.3YesIdentity Mappings in Deep Residual Networks2016-03-16Code
144Evolution77YesLarge-Scale Evolution of Image Classifiers2017-03-03Code
145DIANet76.98YesDIANet: Dense-and-Implicit Attention Network2019-05-25Code
146LP-BNN (ours) + cutout76.85YesEncoding the latent posterior of Bayesian Neural...2020-12-04Code
147ResNet-18+MM+FRL76.64YesLearning Class Unique Features in Fine-Grained V...2020-11-22-
148ResNet32 with reSGHMC76.55YesNon-convex Learning via Replica Exchange Stochas...2020-08-12Code
149MomentumNet76.38YesMomentum Residual Neural Networks2021-02-15Code
150SSCNN75.7YesSpatially-sparse convolutional neural networks2014-09-22Code
151Exponential Linear Units75.7YesFast and Accurate Deep Network Learning by Expon...2015-11-23Code
152ResNet-975.59YesCNN Filter DB: An Empirical Investigation of Tra...2022-03-29Code
153Stochastic Depth75.42YesDeep Networks with Stochastic Depth2016-03-30Code
154ResNet v2-110 (Mish activation)74.41YesMish: A Self Regularized Non-Monotonic Activatio...2019-08-23Code
155Dspike (ResNet-18)74.24No---
156ResNet20 with reSGHMC74.14YesNon-convex Learning via Replica Exchange Stochas...2020-08-12Code
157MixMatch74.1YesMixMatch: A Holistic Approach to Semi-Supervised...2019-05-06Code
158Beta-Rank74.01NoBeta-Rank: A Robust Convolutional Filter Pruning...2023-04-15Code
159PreResNet-11073.98NoHow to Use Dropout Correctly on Residual Network...2023-02-13Code
160ABNet-2G-R073.93NoANDHRA Bandersnatch: Training Neural Networks to...2024-11-28Code
161Fractional MP73.6YesFractional Max-Pooling2014-12-18Code
162ResNet+ELU73.5NoDeep Residual Networks with Exponential Linear U...2016-04-14Code
163PDO-eConv (p6m,0.37M)73YesPDO-eConvs: Partial Differential Operator Based ...2020-07-20Code
164SOPCNN72.96YesStochastic Optimization of Plain Convolutional N...2020-01-24Code
165PDO-eConv (p6,0.36M)72.87YesPDO-eConvs: Partial Differential Operator Based ...2020-07-20Code
166Tuned CNN72.6YesScalable Bayesian Optimization Using Deep Neural...2015-02-19Code
167ResNet-110 (SAP)72.537NoStochastic Subsampling With Average Pooling2024-09-25-
168CMsC72.4YesCompetitive Multi-scale Convolution2015-11-18-
169Fitnet4-LSUV72.3YesAll you need is a good init2015-11-19Code
170GAN+ResNet71.52No--Code
171kMobileNet V3 Large 16ch71.36Yes--Code
172BNM NiN71.1YesBatch-normalized Maxout Network in Network2015-11-09Code
173OTTT71.05NoOnline Training Through Time for Spiking Neural ...2022-10-09Code
174MIM70.8YesOn the Importance of Normalisation Layers in Dee...2015-08-03-
175WaveMix-Lite-256/770.2NoWaveMix: A Resource-efficient Neural Network for...2022-05-28Code
176IM-Loss (VGG-16)70.18No---
177NiN+APL69.2YesLearning Activation Functions to Improve Deep Ne...2014-12-21Code
178SWWAE69.1YesStacked What-Where Auto-encoders2015-06-08Code
179NiN+Superclass+CDJ69YesDeep Convolutional Decision Jungle for Image Cla...2017-06-06-
180Spectral Representations for Convolutional Neural Networks68.4NoSpectral Representations for Convolutional Neura...2015-06-11-
181ReActNet-1868.34No"BNN - BN = ?": Training Binary Neural Networks ...2021-04-16Code
182VDN67.8NoTraining Very Deep Networks2015-07-22Code
183DCNN+GFE67.7NoDeep Convolutional Neural Networks as Generic Fe...2017-10-06-
184Tree+Max-Avg pooling67.6NoGeneralizing Pooling Functions in Convolutional ...2015-09-30Code
185HD-CNN67.4NoHD-CNN: Hierarchical Deep Convolutional Neural N...2014-10-03Code
186Universum Prescription67.2NoUniversum Prescription: Regularization using Unl...2015-11-11-
187ResNet50 Without Transfer Learning67.06No--Code
188AlexNet (KP)66.78No---
189ACN66.3NoStriving for Simplicity: The All Convolutional Net2014-12-21Code
190DLME (ResNet-18, linear)66.1NoDLME: Deep Local-flatness Manifold Embedding2022-07-07Code
191ResNet-18 (modified)66NoFatNet: High Resolution Kernels for Classificati...2022-10-30Code
192DSN65.4NoDeeply-Supervised Nets2014-09-18Code
193NiN64.3NoNetwork In Network2013-12-16Code
194Tree Priors63.2No---
195DNN+Probabilistic Maxout61.9NoImproving Deep Neural Networks with Probabilisti...2013-12-20-
196Maxout Network (k=2)61.43NoMaxout Networks2013-02-18Code
197ResNet20+UnsharpMaskLayer60.36No--Code
198Convolutional Linear Transformer for Vision (CLTV)60.11NoConvolutional Xformers for Vision2022-01-25Code
199FatNet of ResNet-1860NoFatNet: High Resolution Kernels for Classificati...2022-10-30Code
200Optical Simulation of FatNet60NoFatNet: High Resolution Kernels for Classificati...2022-10-30Code
201RReLU59.8NoEmpirical Evaluation of Rectified Activations in...2015-05-05Code
202Stochastic Pooling57.5NoStochastic Pooling for Regularization of Deep Co...2013-01-16Code
203Sign-symmetry48.75NoHow Important is Weight Symmetry in Backpropagat...2015-10-17Code
204AlexNet (DFA)48.03No---
205CNN3942.64NoSharpness-Aware Minimization for Efficiently Imp...2020-10-03Code
206CNN3636.07NoSharpness-Aware Minimization for Efficiently Imp...2020-10-03Code
207CNN3735.05NoSharpness-aware Quantization for Deep Neural Net...2021-11-24Code
208AlexNet (FA)19.49No---