TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Classification/ImageNet-1K (with DeiT-S)

Image Classification on ImageNet-1K (with DeiT-S)

Metric: Top 1 Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Top 1 Accuracy▼Extra DataPaperDate↕Code
1MCTF ($r=16$)80.1NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
2dTPS80.1NoJoint Token Pruning and Squeezing Towards More A...2023-04-21Code
3MCTF ($r=18$)79.9NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
4DiffRate79.8NoDiffRate : Differentiable Compression Rate for E...2023-05-29Code
5PPT79.8NoPPT: Token Pruning and Pooling for Efficient Vis...2023-10-03Code
6DynamicViT (80%)79.8NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
7EViT (80%)79.8NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
8LTMP (80%)79.8NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
9SPViT (3.9G)79.8NoSPViT: Enabling Faster Vision Transformers via S...2021-12-27Code
10EViT (90%)79.8NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
11DynamicViT (90%)79.8NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
12Base (DeiT-S)79.8NoTraining data-efficient image transformers & dis...2020-12-23Code
13ATS79.7NoAdaptive Token Sampling For Efficient Vision Tra...2021-11-30Code
14eTPS79.7NoJoint Token Pruning and Squeezing Towards More A...2023-04-21Code
15ToMe ($r=8$)79.7NoToken Merging: Your ViT But Faster2022-10-17Code
16BAT (70%)79.6NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
17AS-DeiT-S (65%)79.6NoAdaptive Sparse ViT: Towards Learnable Adaptive ...2022-09-28Code
18LTMP (60%)79.6NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
19MCTF ($r=20$)79.5NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
20DPS-ViT79.5NoPatch Slimming for Efficient Vision Transformers2021-06-05-
21EViT (70%)79.5NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
22PS-ViT79.4NoPatch Slimming for Efficient Vision Transformers2021-06-05-
23ToMe ($r=13$)79.4NoToken Merging: Your ViT But Faster2022-10-17Code
24EvoViT79.4NoEvo-ViT: Slow-Fast Token Evolution for Dynamic V...2021-08-03Code
25SPViT (2.6G)79.3NoSPViT: Enabling Faster Vision Transformers via S...2021-12-27Code
26BAT (60%)79.3NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
27DynamicViT (70%)79.3NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
28S$^2$ViTE79.2NoChasing Sparsity in Vision Transformers: An End-...2021-06-08Code
29ToMe ($r=16$)79.1NoToken Merging: Your ViT But Faster2022-10-17Code
30IA-RED$^2$79.1NoIA-RED$^2$: Interpretability-Aware Redundancy Re...2021-06-23-
31BAT (50%)79NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
32EViT (60%)78.9NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
33AS-DeiT-S (50%)78.7NoAdaptive Sparse ViT: Towards Learnable Adaptive ...2022-09-28Code
34BAT (40%)78.6NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
35LTMP (45%)78.6NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
36A-ViT78.6NoAdaViT: Adaptive Tokens for Efficient Vision Tra...2021-12-14Code
37EViT (50%)78.5NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
38HVT-S-178.3NoScalable Vision Transformers with Hierarchical P...2021-03-19Code
39SPViT78.3NoPruning Self-attentions into Convolutional Layer...2021-11-23Code
40BAT (30%)77.8NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
41BAT (20%)76.4NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code