TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Retrieval/Flickr30K 1K test

Image Retrieval on Flickr30K 1K test

Metric: R@1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕R@1▼Extra DataPaperDate↕Code
1X-VLM (base)86.9YesMulti-Grained Vision Language Pre-Training: Alig...2021-11-16Code
2RCAR62.6NoPlug-and-Play Regulators for Image-Text Matching2023-03-23Code
3SGRAF58.5NoSimilarity Reasoning and Filtration for Image-Te...2021-01-05Code
4LGSGM57.4NoA Deep Local and Global Scene-Graph Matching for...2021-06-04Code
5VisualSparta57.4NoVisualSparta: An Embarrassingly Simple Approach ...2021-01-01Code
6TERAN MrSw56.5NoFine-grained Visual Textual Alignment for Cross-...2020-08-12Code
7TERAN Symm.55.7NoFine-grained Visual Textual Alignment for Cross-...2020-08-12Code
8VSRN54.7NoVisual Semantic Reasoning for Image-Text Matching2019-09-06Code
9CAMP51.5NoCAMP: Cross-Modal Adaptive Message Passing for T...2019-09-12Code
10SCAN i-t44NoStacked Cross Attention for Image-Text Matching2018-03-21Code
11SCO41.1NoLearning Semantic Concepts and Order for Image a...2017-12-06-
12DAN39.4NoDual Attention Networks for Multimodal Reasoning...2016-11-02Code
132WayNet (VGG)36NoLinking Image and Text with 2-Way Nets2016-08-29Code
14SM-LSTM (VGG)30.2NoInstance-aware Image and Sentence Matching with ...2016-11-17-
15SPE29.7NoLearning Deep Structure-Preserving Image-Text Em...2015-11-19-
16mCNN26.2NoMultimodal Convolutional Neural Networks for Mat...2015-04-23Code
17HGLMM FV24.7NoFlickr30k Entities: Collecting Region-to-Phrase ...2015-05-19Code
18DVSA (R-CNN, AlexNet)15.2NoDeep Visual-Semantic Alignments for Generating I...2014-12-07Code