Towards Recognizing Unseen Categories in Unseen Domains

Massimiliano Mancini, Zeynep Akata, Elisa Ricci, Barbara Caputo

2020-07-23ECCV 2020 8Zero-Shot Learning + Domain Generalization Domain Generalization Zero-Shot Learning

Abstract

Current deep visual recognition systems suffer from severe performance degradation when they encounter new images from classes and scenarios unseen during training. Hence, the core challenge of Zero-Shot Learning (ZSL) is to cope with the semantic-shift whereas the main challenge of Domain Adaptation and Domain Generalization (DG) is the domain-shift. While historically ZSL and DG tasks are tackled in isolation, this work develops with the ambitious goal of solving them jointly,i.e. by recognizing unseen visual concepts in unseen domains. We presentCuMix (CurriculumMixup for recognizing unseen categories in unseen domains), a holistic algorithm to tackle ZSL, DG and ZSL+DG. The key idea of CuMix is to simulate the test-time domain and semantic shift using images and features from unseen domains and categories generated by mixing up the multiple source domains and categories available during training. Moreover, a curriculum-based mixing policy is devised to generate increasingly complex training samples. Results on standard SL and DG datasets and on ZSL+DG using the DomainNet benchmark demonstrate the effectiveness of our approach.

Results

Task	Dataset	Metric	Value	Model
Domain Adaptation	PACS	Average Accuracy	81.6	CuMix (Resnet-18)
Domain Generalization	PACS	Average Accuracy	81.6	CuMix (Resnet-18)

Related Papers

Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17 GLAD: Generalizable Tuning for Vision-Language Models2025-07-17 MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17 InstructFLIP: Exploring Unified Vision-Language Model for Face Anti-spoofing2025-07-16 DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation2025-07-14 From Physics to Foundation Models: A Review of AI-Driven Quantitative Remote Sensing Inversion2025-07-11 Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion2025-07-08 Prompt-Free Conditional Diffusion for Multi-object Image Augmentation2025-07-08