TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Zero-Shot Transfer Image Classification/ImageNet

Zero-Shot Transfer Image Classification on ImageNet

Metric: Accuracy (Private) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy (Private)▼Extra DataPaperDate↕Code
1M2-Encoder88.5YesM2-Encoder: Advancing Bilingual Image-Text Under...2024-01-29Code
2BASIC (Lion)88.3No---
3CoCa86.3YesCoCa: Contrastive Captioners are Image-Text Foun...2022-05-04Code
4LiT-22B85.9NoScaling Vision Transformers to 22 Billion Parame...2023-02-10Code
5BASIC85.7YesCombined Scaling for Zero-shot Transfer Learning2021-11-19-
6LiT ViT-e85.4NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
7LiT-tuning84.5NoLiT: Zero-Shot Transfer with Locked-image text T...2021-11-15Code
8IMP-MoE-L83.9NoAlternating Gradient Descent and Mixture-of-Expe...2023-05-10-
9EVA-CLIP-18B83.8NoEVA-CLIP-18B: Scaling CLIP to 18 Billion Paramet...2024-02-06Code
10InternVL-C83.2NoInternVL: Scaling up Vision Foundation Models an...2023-12-21Code
11MAWS (ViT-2B)82.1NoThe effectiveness of MAE pre-pretraining for bil...2023-03-23Code
12EVA-CLIP-E/14+82NoEVA-CLIP: Improved Training Techniques for CLIP ...2023-03-27Code
13CLIPA (ViT-H/14-336px)81.8No---
14MAWS (ViT-H)81.1NoThe effectiveness of MAE pre-pretraining for bil...2023-03-23Code
15REACT78.5NoLearning Customized Visual Models with Retrieval...2023-01-17Code
16ALIGN76.4NoScaling Up Visual and Vision-Language Representa...2021-02-11Code
17CLIP(ViT-L/14-336px)76.2YesLearning Transferable Visual Models From Natural...2021-02-26Code
18AltCLIP74.5NoAltCLIP: Altering the Language Encoder in CLIP f...2022-11-12Code
19PaLI72.11YesPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
20Diffusion Classifier (zero-shot)61.4NoYour Diffusion Model is Secretly a Zero-Shot Cla...2023-03-28Code
21CLIP (ResNet50)59.6YesLearning Transferable Visual Models From Natural...2021-02-26Code