TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Retrieval/Flickr30k

Image Retrieval on Flickr30k

Metric: Recall@5 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Recall@5▼Extra DataPaperDate↕Code
1BLIP-2 ViT-G (zero-shot, 1K test set)98.1NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
2BLIP-2 ViT-L (zero-shot, 1K test set)97.6NoBLIP-2: Bootstrapping Language-Image Pre-trainin...2023-01-30Code
3MaMMUT (ours)96NoMaMMUT: A Simple Architecture for Joint Learning...2023-03-29Code
4HADA95.94NoHADA: A Graph-based Amalgamation Framework in Im...2023-01-11Code
5ALBEF95.3NoHADA: A Graph-based Amalgamation Framework in Im...2023-01-11Code
6UNITER94.08NoHADA: A Graph-based Amalgamation Framework in Im...2023-01-11Code
7LGSGM84.1NoA Deep Local and Global Scene-Graph Matching for...2021-06-04Code
8GSMN82.3NoGraph Structured Network for Image-Text Matching2020-04-01Code
9VisualSparta82NoVisualSparta: An Embarrassingly Simple Approach ...2021-01-01Code