TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image-to-Text Retrieval/COCO (Common Objects in Context)

Image-to-Text Retrieval on COCO (Common Objects in Context)

Metric: Recall@10 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Recall@10▼Extra DataPaperDate↕Code
1Oscar99.8NoOscar: Object-Semantics Aligned Pre-training for...2020-04-13Code
2BLIP-2 (ViT-G, fine-tuned)98.5NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
3ONE-PEACE (ViT-G, w/o ranking)98.3NoONE-PEACE: Exploring One General Representation ...2023-05-18Code
4BLIP-2 (ViT-L, fine-tuned)98NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
5Unicoder-VL97.2NoUnicoder-VL: A Universal Encoder for Vision and ...2019-08-16-
6IAIS94.48NoLearning Relation Alignment for Calibrated Cross...2021-05-28Code
7CLIP (zero-shot)88.1NoLearning Transferable Visual Models From Natural...2021-02-26Code
8DVSA74.8NoDeep Visual-Semantic Alignments for Generating I...2014-12-07Code