TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Classification/ObjectNet

Image Classification on ObjectNet

Metric: Top-1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top-1 Accuracy▼Extra DataPaperDate↕Code
1CoCa82.7YesCoCa: Contrastive Captioners are Image-Text Foun...2022-05-04Code
2LiT82.5YesLiT: Zero-Shot Transfer with Locked-image text T...2021-11-15Code
3BASIC82.3YesCombined Scaling for Zero-shot Transfer Learning2021-11-19-
4EVA-02-CLIP-E/14+79.6YesEVA-CLIP: Improved Training Techniques for CLIP ...2023-03-27Code
5Baseline (ViT-G/14)79.03YesModel soups: averaging weights of multiple fine-...2022-03-10Code
6Model soups (ViT-G/14)78.52YesModel soups: averaging weights of multiple fine-...2022-03-10Code
7MAWS (ViT-6.5B)77.9YesThe effectiveness of MAE pre-pretraining for bil...2023-03-23Code
8MAWS (ViT-2B)75.8YesThe effectiveness of MAE pre-pretraining for bil...2023-03-23Code
9MAWS (ViT-H)72.6YesThe effectiveness of MAE pre-pretraining for bil...2023-03-23Code
10CLIP72.3YesLearning Transferable Visual Models From Natural...2021-02-26Code
11ALIGN72.2YesCombined Scaling for Zero-shot Transfer Learning2021-11-19-
12WiSE-FT72.1YesRobust fine-tuning of zero-shot models2021-09-04Code
13ViT-e72NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
14ViT-G/1470.53YesScaling Vision Transformers2021-06-08Code
15SWAG (ViT H/14)69.5YesRevisiting Weakly Supervised Pre-Training of Vis...2022-01-20Code
16NS (Eff.-L2)68.5YesScaling Vision Transformers2021-06-08Code
17RegNetY 128GF (Platt)64.3YesRevisiting Weakly Supervised Pre-Training of Vis...2022-01-20Code
18LLE (ViT-H/14, MAE, Edge Aug)60.78NoA Whac-A-Mole Dilemma: Shortcuts Come in Multipl...2022-12-09Code
19SEER (RegNet10B)60.2YesVision Models Are More Robust And Fair When Pret...2022-02-16Code
20ViT H/14 (Platt)60YesRevisiting Weakly Supervised Pre-Training of Vis...2022-01-20Code
21BiT-L (ResNet-152x4)58.7YesBig Transfer (BiT): General Visual Representatio...2019-12-24Code
22ViT L/16 (Platt)57.3YesRevisiting Weakly Supervised Pre-Training of Vis...2022-01-20Code
23Vit B/16 (Bamboo)53.9YesBamboo: Building Mega-Scale Vision Dataset Conti...2022-03-15Code
24AR-L (Opt Relevance)52YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
25ALIGN-MRL51.6YesMatryoshka Representation Learning2022-05-26Code
26ViT-B/16 (ANN-1.3B)50.7YesBillion-Scale Pretraining with Vision Transforme...2021-08-12-
27ViT-B/16 (512x512) + Pyramid49.39YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
28ResNet-101 (JFT-300M)49.1YesBillion-Scale Pretraining with Vision Transforme...2021-08-12-
29ViT B/1648.9YesRevisiting Weakly Supervised Pre-Training of Vis...2022-01-20Code
30ViT-B/3248.4YesBillion-Scale Pretraining with Vision Transforme...2021-08-12-
31ViT-B/16 (512x512) + Pixel47.53YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
32AR-B (Opt Relevance)47.1YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
33BiT-M (ResNet-152x4)47YesBig Transfer (BiT): General Visual Representatio...2019-12-24Code
34ViT-B/16 (512x512)46.68YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
35ViT-B (Discrete 512x512)46.62YesDiscrete Representations Strengthen Vision Trans...2021-11-20Code
36AR-L46.5YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
37ViT-L (Opt Relevance)43.2YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
38CLIP L42.8YesOptimal Representations for Covariate Shift2021-12-31Code
39ResNet-50 (JFT-300M)42.5YesBillion-Scale Pretraining with Vision Transforme...2021-08-12-
40ViT-B (Opt Relevance)42.2YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
41CLIP L (LAION)42.1YesOptimal Representations for Covariate Shift2021-12-31Code
42AR-B41.4YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
43RegViT on 384x384 + Adv Pyramid39.79YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
44ResNet-152 + GenInt with Transfer39.38YesGenerative Interventions for Causal Learning2020-12-22Code
45AR-S (Opt Relevance)39.3YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
46ResNet-50 (Bamboo)38.8YesBamboo: Building Mega-Scale Vision Dataset Conti...2022-03-15Code
47RegViT on 384x384 + Adv Pixel37.41YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
48ViT-L37.4YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
49DeiT-L (Opt Relevance)36.3YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
50BiT-S (ResNet-152x4)36YesBig Transfer (BiT): General Visual Representatio...2019-12-24Code
51NASNet-A35.77Yes---
52PNASNet-5L35.63Yes---
53RegViT on 384x38435.59YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
54ViT-B35.1YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
55RegViT on 384x384 + Random Pyramid34.83YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
56AR-S34.3YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
57RegViT on 384x384 + Random Pixel34.12YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
58RegViT (RandAug) + Adv Pyramid32.92YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
59Inception-v432.24Yes---
60DeiT-S (Opt Relevance)31.6YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
61ResNet-50 + CGC31.53YesContext-Gated Convolution2019-10-12Code
62DeiT-L31.4YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
63Discrete ViT + Pixel30.98YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
64Discrete ViT + Pyramid30.28YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
65RegViT (RandAug) + Adv Pixel30.11YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
66Discrete ViT29.95YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
67ResNet-15229.59Yes---
68RegViT (RandAug) + Random Pyramid29.41YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
69RegViT (RandAug)29.3YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
70ResNet-50 + GroupNorm29.2YesImproving robustness against common corruptions ...2020-06-30Code
71ResNet-50 + RoHL29.2YesImproving robustness against common corruptions ...2020-06-30Code
72RegViT (RandAug) + Random Pixel28.72YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
73MLP-Mixer + Pyramid28.6YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
74ResNet-50 + FixUp28.5YesImproving robustness against common corruptions ...2020-06-30Code
75ResNet-50 + MixUp (rescaled)28.37YesOn Mixup Regularization2020-06-10Code
76DeiT-S28.3YesOptimizing Relevance Maps of Vision Transformers...2022-06-02Code
77ResNet-18 + GenInt with Transfer27.03YesGenerative Interventions for Causal Learning2020-12-22Code
78MLP-Mixer25.9YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
79RELICv225.9YesPushing the limits of self-supervised ResNets: C...2022-01-13Code
80ViT + MixUp25.65YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
81C-BYOL25.5YesCompressive Visual Representations2021-09-27Code
82MLP-Mixer + Pixel24.75YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
83BYOL (BG_RM)23.9YesCharacterizing and Improving the Robustness of S...2021-03-23-
84RELIC23.8YesPushing the limits of self-supervised ResNets: C...2022-01-13Code
85BYOL23YesPushing the limits of self-supervised ResNets: C...2022-01-13Code
86SwAV (BG_RM)21.9YesCharacterizing and Improving the Robustness of S...2021-03-23-
87ViT + CutMix21.61YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
88MoCo-v2 (BG_Swaps)20.8YesCharacterizing and Improving the Robustness of S...2021-03-23-
89C-SimCLR20.8YesCompressive Visual Representations2021-09-27Code
90SeLa(v2) (reverse linear probing)20.61Yes---
91DILEMMA20.51YesRepresentation Learning by Detecting Incorrect L...2022-04-10Code
92DeepCluster(v2) (reverse linear probing)19.73Yes---
93VGG-1419.13Yes---
94ResNet-50 (ImageNet-Captions)18.7YesData Determines Distributional Robustness in Con...2022-05-03Code
95SwAV (reverse linear probing)17.71Yes---
96ViT17.36YesPyramid Adversarial Training Improves ViT Perfor...2021-11-30Code
97ResNet34-RPG16.5YesCompact and Optimal Deep Learning with Recurrent...2021-07-15Code
98CLIP (CC12M pretrain)15.24YesRobust Cross-Modal Representation Learning with ...2022-04-10-
99SimCLR14.6YesPushing the limits of self-supervised ResNets: C...2022-01-13Code
100ResNet-152 (FRCNN-ag-ad, VOC)13.2YesClass-agnostic Object Detection2020-11-28-
101MoCo(v2) (reverse linear probing)12.67Yes---
102MoCHi (reverse linear probing)12.64Yes---
103OBoW (reverse linear probing)12.23Yes---
104AlexNet6.78Yes---
105BigBiGAN (RevNet-50 4×)4.92YesSelf-Supervised Learning for Large-Scale Unsuper...2020-08-24Code