MaskFeat (no extra data, MViT-L)
Reported on 6 benchmarks across 1 task · 1 paper · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision6 results
- Top-1 Accuracy· 2021-12-16SOTA80.4best: 85.9 (InternVideo2-6B)
- Top-5 Accuracy· 2021-12-16SOTA95.7best: 96.7 (UMT-L (ViT-L/16))
- Top-1 Accuracy· 2021-12-16SOTA88.3best: 91.9 (InternVideo2-6B)
- Top-5 Accuracy· 2021-12-16SOTA98best: 98.9 (TubeVit-H)
- Acc@1· 2021-12-1686.7best: 93.6 (OmniVec2)
- Acc@5· 2021-12-1697.3best: 98.9 (TubeViT-H (ImageNet-1k))