TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Delving into Out-of-Distribution Detection with Vision-Lan...

Delving into Out-of-Distribution Detection with Vision-Language Representations

Yifei Ming, Ziyang Cai, Jiuxiang Gu, Yiyou Sun, Wei Li, Yixuan Li

2022-11-24Out-of-Distribution Detection
PaperPDFCodeCode(official)

Abstract

Recognizing out-of-distribution (OOD) samples is critical for machine learning systems deployed in the open world. The vast majority of OOD detection methods are driven by a single modality (e.g., either vision or language), leaving the rich information in multi-modal representations untapped. Inspired by the recent success of vision-language pre-training, this paper enriches the landscape of OOD detection from a single-modal to a multi-modal regime. Particularly, we propose Maximum Concept Matching (MCM), a simple yet effective zero-shot OOD detection method based on aligning visual features with textual concepts. We contribute in-depth analysis and theoretical insights to understand the effectiveness of MCM. Extensive experiments demonstrate that MCM achieves superior performance on a wide variety of real-world tasks. MCM with vision-language features outperforms a common baseline with pure visual features on a hard OOD task with semantically similar classes by 13.1% (AUROC). Code is available at https://github.com/deeplearning-wisc/MCM.

Results

TaskDatasetMetricValueModel
Out-of-Distribution DetectionImageNet-1k vs iNaturalistAUROC94.95MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs iNaturalistFPR9528.38MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs iNaturalistAUROC94.61MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs iNaturalistFPR9530.91MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs TexturesAUROC86.11MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs TexturesFPR9557.77MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs TexturesAUROC84.88MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs TexturesFPR9559.88MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs PlacesAUROC92MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs PlacesFPR9535.42MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs PlacesAUROC89.77MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs PlacesFPR9544.69MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs SUNAUROC94.14MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs SUNFPR9529MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs SUNAUROC92.57MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs SUNFPR9537.59MCM (CLIP-B)
Out-of-Distribution DetectionImageNet-1k vs Curated OODs (avg.)AUROC91.49MCM (CLIP-L)
Out-of-Distribution DetectionImageNet-1k vs Curated OODs (avg.)FPR9538.17MCM (CLIP-L)

Related Papers

Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation2025-07-08FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection2025-07-06Out-of-distribution detection in 3D applications: a review2025-07-01Generative Adversarial Evasion and Out-of-Distribution Detection for UAV Cyber-Attacks2025-06-26Enclosing Prototypical Variational Autoencoder for Explainable Out-of-Distribution Detection2025-06-17A Variational Information Theoretic Approach to Out-of-Distribution Detection2025-06-17DIsoN: Decentralized Isolation Networks for Out-of-Distribution Detection in Medical Imaging2025-06-10EviNet: Evidential Reasoning Network for Resilient Graph Learning in the Open and Noisy Environments2025-06-08