TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Read-only Prompt Optimization for Vision-Language Few-shot...

Read-only Prompt Optimization for Vision-Language Few-shot Learning

Dongjun Lee, Seokwon Song, Jihee Suh, Joonmyung Choi, Sanghyeok Lee, Hyunwoo J. Kim

2023-08-29ICCV 2023 1Few-Shot LearningPrompt EngineeringDomain Generalization
PaperPDFCode(official)

Abstract

In recent years, prompt tuning has proven effective in adapting pre-trained vision-language models to downstream tasks. These methods aim to adapt the pre-trained models by introducing learnable prompts while keeping pre-trained weights frozen. However, learnable prompts can affect the internal representation within the self-attention module, which may negatively impact performance variance and generalization, especially in data-deficient settings. To address these issues, we propose a novel approach, Read-only Prompt Optimization (RPO). RPO leverages masked attention to prevent the internal representation shift in the pre-trained model. Further, to facilitate the optimization of RPO, the read-only prompts are initialized based on special tokens of the pre-trained model. Our extensive experiments demonstrate that RPO outperforms CLIP and CoCoOp in base-to-new generalization and domain generalization while displaying better robustness. Also, the proposed method achieves better generalization on extremely data-deficient settings, while improving parameter efficiency and computational overhead. Code is available at https://github.com/mlvlab/RPO.

Results

TaskDatasetMetricValueModel
Prompt EngineeringStanford CarsHarmonic mean74.69RPO
Prompt EngineeringOxford 102 FlowerHarmonic mean84.5RPO
Prompt EngineeringEuroSATHarmonic mean76.79RPO
Prompt EngineeringOxford-IIIT Pet DatasetHarmonic mean96.05RPO
Prompt EngineeringDTDHarmonic mean68.61RPO
Prompt EngineeringUCF101Harmonic mean79.34RPO
Prompt EngineeringFood-101Harmonic mean90.58RPO
Prompt EngineeringCaltech-101Harmonic mean96.03RPO
Prompt EngineeringImageNetHarmonic mean74RPO
Prompt EngineeringFGVC-AircraftHarmonic mean35.7RPO
Prompt EngineeringSUN397Harmonic mean79.18RPO

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Leveraging Language Prior for Infrared Small Target Detection2025-07-17Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing2025-07-16Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges2025-07-13From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion2025-07-11