TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image-to-Text Retrieval/COCO (Common Objects in Context)

Image-to-Text Retrieval on COCO (Common Objects in Context)

Metric: Recall@5 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Recall@5▼Extra DataPaperDate↕Code
1BLIP-2 (ViT-G, fine-tuned)97NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
2ONE-PEACE (ViT-G, w/o ranking)96.3NoONE-PEACE: Exploring One General Representation ...2023-05-18Code
3BLIP-2 (ViT-L, fine-tuned)96NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
4IAIS89.7NoLearning Relation Alignment for Calibrated Cross...2021-05-28Code
5CLIP (zero-shot)81.5NoLearning Transferable Visual Models From Natural...2021-02-26Code
6FLAVA (ViT-B, zero-shot)76.76NoFLAVA: A Foundational Language And Vision Alignm...2021-12-08Code