TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image-to-Text Retrieval/COCO (Common Objects in Context)

Image-to-Text Retrieval on COCO (Common Objects in Context)

Metric: Recall@1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Recall@1▼Extra DataPaperDate↕Code
1BLIP-2 (ViT-G, fine-tuned)85.4NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
2ONE-PEACE (ViT-G, w/o ranking)84.1NoONE-PEACE: Exploring One General Representation ...2023-05-18Code
3BLIP-2 (ViT-L, fine-tuned)83.5NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
4IAIS67.78NoLearning Relation Alignment for Calibrated Cross...2021-05-28Code
5CLIP (zero-shot)58.4NoLearning Transferable Visual Models From Natural...2021-02-26Code
6FLAVA (ViT-B, zero-shot)42.74NoFLAVA: A Foundational Language And Vision Alignm...2021-12-08Code