ViT-B-VTN (3 layers, ImageNet pretrain)
Reported on 2 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision2 results
- Acc@1· 2021-02-0178.6best: 93.6 (OmniVec2)
- Acc@5· 2021-02-0193.7best: 98.9 (TubeViT-H (ImageNet-1k))