TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Classification/ImageNet-1K (with DeiT-S)

Image Classification on ImageNet-1K (with DeiT-S)

Metric: GFLOPs (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕GFLOPs▼Extra DataPaperDate↕Code
1Base (DeiT-S)4.6NoTraining data-efficient image transformers & dis...2020-12-23Code
2EViT (90%)4NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
3DynamicViT (90%)4NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
4SPViT (3.9G)3.9NoSPViT: Enabling Faster Vision Transformers via S...2021-12-27Code
5LTMP (80%)3.8NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
6A-ViT3.6NoAdaViT: Adaptive Tokens for Efficient Vision Tra...2021-12-14Code
7EViT (80%)3.5NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
8DynamicViT (80%)3.4NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
9ToMe ($r=8$)3.4NoToken Merging: Your ViT But Faster2022-10-17Code
10SPViT3.3NoPruning Self-attentions into Convolutional Layer...2021-11-23Code
11S$^2$ViTE3.2NoChasing Sparsity in Vision Transformers: An End-...2021-06-08Code
12IA-RED$^2$3.2NoIA-RED$^2$: Interpretability-Aware Redundancy Re...2021-06-23-
13dTPS3NoJoint Token Pruning and Squeezing Towards More A...2023-04-21Code
14eTPS3NoJoint Token Pruning and Squeezing Towards More A...2023-04-21Code
15BAT (70%)3NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
16AS-DeiT-S (65%)3NoAdaptive Sparse ViT: Towards Learnable Adaptive ...2022-09-28Code
17LTMP (60%)3NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
18EViT (70%)3NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
19EvoViT3NoEvo-ViT: Slow-Fast Token Evolution for Dynamic V...2021-08-03Code
20DiffRate2.9NoDiffRate : Differentiable Compression Rate for E...2023-05-29Code
21PPT2.9NoPPT: Token Pruning and Pooling for Efficient Vis...2023-10-03Code
22ATS2.9NoAdaptive Token Sampling For Efficient Vision Tra...2021-11-30Code
23DynamicViT (70%)2.9NoDynamicViT: Efficient Vision Transformers with D...2021-06-03Code
24ToMe ($r=13$)2.7NoToken Merging: Your ViT But Faster2022-10-17Code
25HVT-S-12.7NoScalable Vision Transformers with Hierarchical P...2021-03-19Code
26MCTF ($r=16$)2.6NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
27PS-ViT2.6NoPatch Slimming for Efficient Vision Transformers2021-06-05-
28SPViT (2.6G)2.6NoSPViT: Enabling Faster Vision Transformers via S...2021-12-27Code
29BAT (60%)2.6NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
30EViT (60%)2.6NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
31MCTF ($r=18$)2.4NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
32DPS-ViT2.4NoPatch Slimming for Efficient Vision Transformers2021-06-05-
33ToMe ($r=16$)2.3NoToken Merging: Your ViT But Faster2022-10-17Code
34BAT (50%)2.3NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
35AS-DeiT-S (50%)2.3NoAdaptive Sparse ViT: Towards Learnable Adaptive ...2022-09-28Code
36LTMP (45%)2.3NoLearned Thresholds Token Merging and Pruning for...2023-07-20Code
37EViT (50%)2.3NoNot All Patches are What You Need: Expediting Vi...2022-02-16Code
38MCTF ($r=20$)2.2NoMulti-criteria Token Fusion with One-step-ahead ...2024-03-15Code
39BAT (40%)2NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
40BAT (30%)1.8NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code
41BAT (20%)1.6NoBeyond Attentive Tokens: Incorporating Token Imp...2022-11-21Code