CLIP (ViT-L/14)
Reported on 2 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Reasoning2 results
- Image Score· 2023-11-178best: 68.75 (GPT-4V (CoT, pick b/w two options))
- Text Score· 2023-11-1730.25best: 75.5 (GPT-4o + CA)