TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Resolving Semantic Confusions for Improved Zero-Shot Detec...

Resolving Semantic Confusions for Improved Zero-Shot Detection

Sandipan Sarma, Sushil Kumar, Arijit Sur

2022-12-12British Machine Vision Conference 2022 11Transfer LearningZero-Shot Object DetectionGeneralized Zero-Shot Object DetectionZero-Shot LearningObject Detection
PaperPDFCode(official)

Abstract

Zero-shot detection (ZSD) is a challenging task where we aim to recognize and localize objects simultaneously, even when our model has not been trained with visual samples of a few target ("unseen") classes. Recently, methods employing generative models like GANs have shown some of the best results, where unseen-class samples are generated based on their semantics by a GAN trained on seen-class data, enabling vanilla object detectors to recognize unseen objects. However, the problem of semantic confusion still remains, where the model is sometimes unable to distinguish between semantically-similar classes. In this work, we propose to train a generative model incorporating a triplet loss that acknowledges the degree of dissimilarity between classes and reflects them in the generated samples. Moreover, a cyclic-consistency loss is also enforced to ensure that generated visual samples of a class highly correspond to their own semantics. Extensive experiments on two benchmark ZSD datasets - MSCOCO and PASCAL-VOC - demonstrate significant gains over the current ZSD methods, reducing semantic confusion and improving detection for the unseen classes.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCORecall65.1ZSD-SCR
Object DetectionMS-COCOmAP20.1ZSD-SCR
Object DetectionPASCAL VOC'07mAP62.7ZSD-SCR
3DMS-COCORecall65.1ZSD-SCR
3DMS-COCOmAP20.1ZSD-SCR
3DPASCAL VOC'07mAP62.7ZSD-SCR
2D ClassificationMS-COCORecall65.1ZSD-SCR
2D ClassificationMS-COCOmAP20.1ZSD-SCR
2D ClassificationPASCAL VOC'07mAP62.7ZSD-SCR
2D Object DetectionMS-COCORecall65.1ZSD-SCR
2D Object DetectionMS-COCOmAP20.1ZSD-SCR
2D Object DetectionPASCAL VOC'07mAP62.7ZSD-SCR
16kMS-COCORecall65.1ZSD-SCR
16kMS-COCOmAP20.1ZSD-SCR
16kPASCAL VOC'07mAP62.7ZSD-SCR

Related Papers

RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17GLAD: Generalizable Tuning for Vision-Language Models2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16