CLIP(ViT-L/14-336px)
Reported on 1 benchmark across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Computer Vision1 result
- Accuracy (Private)· uses extra data· 2021-02-2676.2best: 88.5 (M2-Encoder)