TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Synthesizing the Unseen for Zero-shot Object Detection

Synthesizing the Unseen for Zero-shot Object Detection

Nasir Hayat, Munawar Hayat, Shafin Rahman, Salman Khan, Syed Waqas Zamir, Fahad Shahbaz Khan

2020-10-19Zero-Shot Object DetectionGeneralized Zero-Shot Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

The existing zero-shot detection approaches project visual features to the semantic domain for seen objects, hoping to map unseen objects to their corresponding semantics during inference. However, since the unseen objects are never visualized during training, the detection model is skewed towards seen content, thereby labeling unseen as background or a seen class. In this work, we propose to synthesize visual features for unseen classes, so that the model learns both seen and unseen objects in the visual domain. Consequently, the major challenge becomes, how to accurately synthesize unseen objects merely using their class semantics? Towards this ambitious goal, we propose a novel generative model that uses class-semantics to not only generate the features but also to discriminatively separate them. Further, using a unified model, we ensure the synthesized features have high diversity that represents the intra-class differences and variable localization precision in the detected bounding boxes. We test our approach on three object detection benchmarks, PASCAL VOC, MSCOCO, and ILSVRC detection, under both conventional and generalized settings, showing impressive gains over the state-of-the-art methods. Our codes are available at https://github.com/nasir6/zero_shot_detection.

Results

TaskDatasetMetricValueModel
Object DetectionImageNet DetectionmAP24.3SUZOD
Object DetectionMS-COCORecall61.4SUZOD
Object DetectionMS-COCOmAP17.3SUZOD
Object DetectionPASCAL VOC'07mAP64.9SUZOD
3DImageNet DetectionmAP24.3SUZOD
3DMS-COCORecall61.4SUZOD
3DMS-COCOmAP17.3SUZOD
3DPASCAL VOC'07mAP64.9SUZOD
2D ClassificationImageNet DetectionmAP24.3SUZOD
2D ClassificationMS-COCORecall61.4SUZOD
2D ClassificationMS-COCOmAP17.3SUZOD
2D ClassificationPASCAL VOC'07mAP64.9SUZOD
2D Object DetectionImageNet DetectionmAP24.3SUZOD
2D Object DetectionMS-COCORecall61.4SUZOD
2D Object DetectionMS-COCOmAP17.3SUZOD
2D Object DetectionPASCAL VOC'07mAP64.9SUZOD
16kImageNet DetectionmAP24.3SUZOD
16kMS-COCORecall61.4SUZOD
16kMS-COCOmAP17.3SUZOD
16kPASCAL VOC'07mAP64.9SUZOD

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07