Image Retrieval with Multi-Modal Query on Recipe1M+

Metric: Image-to-text R@1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelImage-to-text R@1Extra DataPaperDateCode
1VLPCook45.2NoVision and Structured-Language Pretraining for C...2022-12-08Code
2Marin et al.17NoRecipe1M+: A Dataset for Learning Cross-Modal Em...2018-10-14-