TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Retrieval/Flickr30K 1K test

Image Retrieval on Flickr30K 1K test

Metric: R@5 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@5▼Extra DataPaperDate↕Code
1X-VLM (base)97.3YesMulti-Grained Vision Language Pre-Training: Alig...2021-11-16Code
2RCAR85.8NoPlug-and-Play Regulators for Image-Text Matching2023-03-23Code
3LGSGM84.1NoA Deep Local and Global Scene-Graph Matching for...2021-06-04Code
4TERAN Symm.83.1NoFine-grained Visual Textual Alignment for Cross-...2020-08-12Code
5SGRAF83NoSimilarity Reasoning and Filtration for Image-Te...2021-01-05Code
6VisualSparta82NoVisualSparta: An Embarrassingly Simple Approach ...2021-01-01Code
7VSRN81.8NoVisual Semantic Reasoning for Image-Text Matching2019-09-06Code
8TERAN MrSw81.2NoFine-grained Visual Textual Alignment for Cross-...2020-08-12Code
9CAMP77.1NoCAMP: Cross-Modal Adaptive Message Passing for T...2019-09-12Code
10SCAN i-t74.2NoStacked Cross Attention for Image-Text Matching2018-03-21Code
11SCO70.5NoLearning Semantic Concepts and Order for Image a...2017-12-06-
12DAN69.2NoDual Attention Networks for Multimodal Reasoning...2016-11-02Code
13SPE60.1NoLearning Deep Structure-Preserving Image-Text Em...2015-11-19-
14mCNN56.3NoMultimodal Convolutional Neural Networks for Mat...2015-04-23Code
15HGLMM FV53.4NoFlickr30k Entities: Collecting Region-to-Phrase ...2015-05-19Code