X-CLIP(ViT-L/14, CLIP)
Reported on 4 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision4 results
- Acc@1· 2022-08-0487.7best: 93.6 (OmniVec2)
- Acc@5· 2022-08-0497.4best: 98.9 (TubeViT-H (ImageNet-1k))
- Top-1 Accuracy· uses extra data· 2022-08-0488.3best: 91.9 (InternVideo2-6B)
- Top-5 Accuracy· uses extra data· 2022-08-0497.7best: 98.9 (TubeVit-H)