TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FooDI-ML: a large multi-language dataset of food, drinks a...

FooDI-ML: a large multi-language dataset of food, drinks and groceries images and descriptions

David Amat Olóndriz, Ponç Palau Puigdevall, Adrià Salvador Palau

2021-10-05RetrievalImage GenerationConditional Image GenerationImage Retrieval
PaperPDFCode(official)

Abstract

In this paper we introduce the FooDI-ML dataset. This dataset contains over 1.5M unique images and over 9.5M store names, product names descriptions, and collection sections gathered from the Glovo application. The data made available corresponds to food, drinks and groceries products from 37 countries in Europe, the Middle East, Africa and Latin America. The dataset comprehends 33 languages, including 870K samples of languages of countries from Eastern Europe and Western Asia such as Ukrainian and Kazakh, which have been so far underrepresented in publicly available visio-linguistic datasets. The dataset also includes widely spoken languages such as Spanish and English. To assist further research, we include benchmarks over two tasks: text-image retrieval and conditional image generation.

Results

TaskDatasetMetricValueModel
Image RetrievalFooDI-ML (Spain)A-R@10.93ADAPT-I2T
Image RetrievalFooDI-ML (Spain)A-R@105.8ADAPT-I2T
Image RetrievalFooDI-ML (Spain)A-R@53.33ADAPT-I2T
Image RetrievalFooDI-ML (Spain)Re-R@10.73ADAPT-I2T
Image RetrievalFooDI-ML (Spain)Re-R@105.67ADAPT-I2T
Image RetrievalFooDI-ML (Spain)Re-R@52.93ADAPT-I2T
Image RetrievalFooDI-ML (Global)A-R@10.005ADAPT-I2T
Image RetrievalFooDI-ML (Global)A-R@100.05ADAPT-I2T
Image RetrievalFooDI-ML (Global)A-R@50.02ADAPT-I2T
Image RetrievalFooDI-ML (Global)Re-R@10.01ADAPT-I2T
Image RetrievalFooDI-ML (Global)Re-R@100.045ADAPT-I2T
Image RetrievalFooDI-ML (Global)Re-R@50.03ADAPT-I2T

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17A Survey of Context Engineering for Large Language Models2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17