TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semi-Supervised Domain Generalization for Object Detection...

Semi-Supervised Domain Generalization for Object Detection via Language-Guided Feature Alignment

Sina Malakouti, Adriana Kovashka

2023-09-24BMVC 2023 11DescriptiveDomain Generalizationobject-detectionObject DetectionDomain Adaptation
PaperPDFCode(official)

Abstract

Existing domain adaptation (DA) and generalization (DG) methods in object detection enforce feature alignment in the visual space but face challenges like object appearance variability and scene complexity, which make it difficult to distinguish between objects and achieve accurate detection. In this paper, we are the first to address the problem of semi-supervised domain generalization by exploring vision-language pre-training and enforcing feature alignment through the language space. We employ a novel Cross-Domain Descriptive Multi-Scale Learning (CDDMSL) aiming to maximize the agreement between descriptions of an image presented with different domain-specific characteristics in the embedding space. CDDMSL significantly outperforms existing methods, achieving 11.7% and 7.5% improvement in DG and DA settings, respectively. Comprehensive analysis and ablation studies confirm the effectiveness of our method, positioning CDDMSL as a promising approach for domain generalization in object detection tasks.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC to Watercolor2kmAp49.7CDDMSL
Object DetectionBDD100KMAP27.1CDDMSL
Object DetectionWatercolor2kMAP49.8CDDMSL
Object DetectionComic2kmAP45.9CDDMSL
Object DetectionPASCAL VOC to Comic2kmAP46.3CDDMSL
Object DetectionPascal VOC to Clipart1KmAP40.4CDDMSL
Object DetectionCityscapes to Foggy CityscapesmAP54.3CDDMSL
Object DetectionClipart1kMAP39.8CDDMSL
3DPASCAL VOC to Watercolor2kmAp49.7CDDMSL
3DBDD100KMAP27.1CDDMSL
3DWatercolor2kMAP49.8CDDMSL
3DComic2kmAP45.9CDDMSL
3DPASCAL VOC to Comic2kmAP46.3CDDMSL
3DPascal VOC to Clipart1KmAP40.4CDDMSL
3DCityscapes to Foggy CityscapesmAP54.3CDDMSL
3DClipart1kMAP39.8CDDMSL
2D ClassificationPASCAL VOC to Watercolor2kmAp49.7CDDMSL
2D ClassificationBDD100KMAP27.1CDDMSL
2D ClassificationWatercolor2kMAP49.8CDDMSL
2D ClassificationComic2kmAP45.9CDDMSL
2D ClassificationPASCAL VOC to Comic2kmAP46.3CDDMSL
2D ClassificationPascal VOC to Clipart1KmAP40.4CDDMSL
2D ClassificationCityscapes to Foggy CityscapesmAP54.3CDDMSL
2D ClassificationClipart1kMAP39.8CDDMSL
2D Object DetectionPASCAL VOC to Watercolor2kmAp49.7CDDMSL
2D Object DetectionBDD100KMAP27.1CDDMSL
2D Object DetectionWatercolor2kMAP49.8CDDMSL
2D Object DetectionComic2kmAP45.9CDDMSL
2D Object DetectionPASCAL VOC to Comic2kmAP46.3CDDMSL
2D Object DetectionPascal VOC to Clipart1KmAP40.4CDDMSL
2D Object DetectionCityscapes to Foggy CityscapesmAP54.3CDDMSL
2D Object DetectionClipart1kMAP39.8CDDMSL
16kPASCAL VOC to Watercolor2kmAp49.7CDDMSL
16kBDD100KMAP27.1CDDMSL
16kWatercolor2kMAP49.8CDDMSL
16kComic2kmAP45.9CDDMSL
16kPASCAL VOC to Comic2kmAP46.3CDDMSL
16kPascal VOC to Clipart1KmAP40.4CDDMSL
16kCityscapes to Foggy CityscapesmAP54.3CDDMSL
16kClipart1kMAP39.8CDDMSL

Related Papers

DiffRhythm+: Controllable and Flexible Full-Length Song Generation with Preference Optimization2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17MoTM: Towards a Foundation Model for Time Series Imputation based on Continuous Modeling2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17