Image-to-Text Retrieval on COCO

Metric: Recall@1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelRecall@1Extra DataPaperDateCode
1SigLIP (ViT-L, zero-shot)70.6NoSigmoid Loss for Language Image Pre-Training2023-03-27Code