Swin-L (384x384, ImageNet-21k pretrain)
Reported on 4 benchmarks across 1 task · 1 paper · 2 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision4 results
- Acc@5· 2021-06-24SOTA96.7best: 98.9 (TubeViT-H (ImageNet-1k))
- Top-5 Accuracy· uses extra data· 2021-06-24SOTA97.3best: 98.9 (TubeVit-H)
- Acc@1· 2021-06-2484.9best: 93.6 (OmniVec2)
- Top-1 Accuracy· uses extra data· 2021-06-2486.1best: 91.9 (InternVideo2-6B)