TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LLaVA-Chef: A Multi-modal Generative Model for Food Recipes

LLaVA-Chef: A Multi-modal Generative Model for Food Recipes

Fnu Mohbat, Mohammed J. Zaki

2024-08-29Recipe Generation
PaperPDFCode(official)

Abstract

In the rapidly evolving landscape of online recipe sharing within a globalized context, there has been a notable surge in research towards comprehending and generating food recipes. Recent advancements in large language models (LLMs) like GPT-2 and LLaVA have paved the way for Natural Language Processing (NLP) approaches to delve deeper into various facets of food-related tasks, encompassing ingredient recognition and comprehensive recipe generation. Despite impressive performance and multi-modal adaptability of LLMs, domain-specific training remains paramount for their effective application. This work evaluates existing LLMs for recipe generation and proposes LLaVA-Chef, a novel model trained on a curated dataset of diverse recipe prompts in a multi-stage approach. First, we refine the mapping of visual food image embeddings to the language space. Second, we adapt LLaVA to the food domain by fine-tuning it on relevant recipe data. Third, we utilize diverse prompts to enhance the model's recipe comprehension. Finally, we improve the linguistic quality of generated recipes by penalizing the model with a custom loss function. LLaVA-Chef demonstrates impressive improvements over pretrained LLMs and prior works. A detailed qualitative analysis reveals that LLaVA-Chef generates more detailed recipes with precise ingredient mentions, compared to existing approaches.

Results

TaskDatasetMetricValueModel
Recipe Generationallrecipes.comBLEU6LLaVA-Chef
Recipe Generationallrecipes.comPerplexity2.6LLaVA-Chef
Recipe GenerationFood.comBLEU-129LLaVA-Chef
Recipe GenerationFood.comBLEU-46LLaVA-Chef
Recipe GenerationFood.comBPE Perplexity2.6LLaVA-Chef
Recipe GenerationFood.comRouge-L18.4LLaVA-Chef
Recipe GenerationNow You're Cooking!Perplexity2.6LLaVA-Chef

Related Papers

RecipeGen: A Step-Aligned Multimodal Benchmark for Real-World Recipe Generation2025-06-07Few-Shot Test-Time Optimization Without Retraining for Semiconductor Recipe Generation and Beyond2025-05-21KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models2025-05-20Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study2025-02-04Retrieval Augmented Recipe Generation2024-11-13Culinary Class Wars: Evaluating LLMs using ASH in Cuisine Transfer Task2024-11-04FoodMLLM-JP: Leveraging Multimodal Large Language Models for Japanese Recipe Generation2024-09-27Instruction Following without Instruction Tuning2024-09-21