TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bayesian Prompt Learning for Image-Language Model Generali...

Bayesian Prompt Learning for Image-Language Model Generalization

Mohammad Mahdi Derakhshani, Enrique Sanchez, Adrian Bulat, Victor Guilherme Turrisi da Costa, Cees G. M. Snoek, Georgios Tzimiropoulos, Brais Martinez

2022-10-05ICCV 2023 1Few-Shot LearningPrompt EngineeringMultimodal Deep LearningLanguage ModellingVariational Inference
PaperPDFCode(official)

Abstract

Foundational image-language models have generated considerable interest due to their efficient adaptation to downstream tasks by prompt learning. Prompt learning treats part of the language model input as trainable while freezing the rest, and optimizes an Empirical Risk Minimization objective. However, Empirical Risk Minimization is known to suffer from distributional shifts which hurt generalizability to prompts unseen during training. By leveraging the regularization ability of Bayesian methods, we frame prompt learning from the Bayesian perspective and formulate it as a variational inference problem. Our approach regularizes the prompt space, reduces overfitting to the seen prompts and improves the prompt generalization on unseen prompts. Our framework is implemented by modeling the input prompt space in a probabilistic manner, as an a priori distribution which makes our proposal compatible with prompt learning approaches that are unconditional or conditional on the image. We demonstrate empirically on 15 benchmarks that Bayesian prompt learning provides an appropriate coverage of the prompt space, prevents learning spurious features, and exploits transferable invariant features. This results in better generalization of unseen prompts, even across different datasets and domains. Code available at: https://github.com/saic-fi/Bayesian-Prompt-Learning

Results

TaskDatasetMetricValueModel
Few-Shot LearningStanforCarsHarmonic mean73.07Variational Prompt Tuning
Few-Shot LearningFGVC AircraftHarmonic mean34.69Variational Prompt Tuning
Few-Shot Learningfood101Harmonic mean91.57Variational Prompt Tuning
Few-Shot LearningFlowers-102Harmonic mean81.12Variational Prompt Tuning
Few-Shot LearningEuroSATHarmonic mean77.71Variational Prompt Tuning
Few-Shot LearningUCF101Harmonic mean79Variational Prompt Tuning
Few-Shot LearningOxfordPetsHarmonic mean96.82Variational Prompt Tuning
Few-Shot LearningSUN397Harmonic mean78.51Variational Prompt Tuning
Few-Shot LearningCaltech101Harmonic mean96.44Variational Prompt Tuning
Few-Shot LearningDTDHarmonic mean67.27Variational Prompt Tuning
Meta-LearningStanforCarsHarmonic mean73.07Variational Prompt Tuning
Meta-LearningFGVC AircraftHarmonic mean34.69Variational Prompt Tuning
Meta-Learningfood101Harmonic mean91.57Variational Prompt Tuning
Meta-LearningFlowers-102Harmonic mean81.12Variational Prompt Tuning
Meta-LearningEuroSATHarmonic mean77.71Variational Prompt Tuning
Meta-LearningUCF101Harmonic mean79Variational Prompt Tuning
Meta-LearningOxfordPetsHarmonic mean96.82Variational Prompt Tuning
Meta-LearningSUN397Harmonic mean78.51Variational Prompt Tuning
Meta-LearningCaltech101Harmonic mean96.44Variational Prompt Tuning
Meta-LearningDTDHarmonic mean67.27Variational Prompt Tuning

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Leveraging Language Prior for Infrared Small Target Detection2025-07-17Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17