Resolving Semantic Confusions for Improved Zero-Shot Detection

Sandipan Sarma, Sushil Kumar, Arijit Sur

2022-12-12British Machine Vision Conference 2022 11Transfer Learning Zero-Shot Object Detection Generalized Zero-Shot Object Detection Zero-Shot Learning Object Detection

Paper PDF Code(official)

Abstract

Zero-shot detection (ZSD) is a challenging task where we aim to recognize and localize objects simultaneously, even when our model has not been trained with visual samples of a few target ("unseen") classes. Recently, methods employing generative models like GANs have shown some of the best results, where unseen-class samples are generated based on their semantics by a GAN trained on seen-class data, enabling vanilla object detectors to recognize unseen objects. However, the problem of semantic confusion still remains, where the model is sometimes unable to distinguish between semantically-similar classes. In this work, we propose to train a generative model incorporating a triplet loss that acknowledges the degree of dissimilarity between classes and reflects them in the generated samples. Moreover, a cyclic-consistency loss is also enforced to ensure that generated visual samples of a class highly correspond to their own semantics. Extensive experiments on two benchmark ZSD datasets - MSCOCO and PASCAL-VOC - demonstrate significant gains over the current ZSD methods, reducing semantic confusion and improving detection for the unseen classes.

Results

Task	Dataset	Metric	Value	Model
Object Detection	MS-COCO	Recall	65.1	ZSD-SCR
Object Detection	MS-COCO	mAP	20.1	ZSD-SCR
Object Detection	PASCAL VOC'07	mAP	62.7	ZSD-SCR
3D	MS-COCO	Recall	65.1	ZSD-SCR
3D	MS-COCO	mAP	20.1	ZSD-SCR
3D	PASCAL VOC'07	mAP	62.7	ZSD-SCR
2D Classification	MS-COCO	Recall	65.1	ZSD-SCR
2D Classification	MS-COCO	mAP	20.1	ZSD-SCR
2D Classification	PASCAL VOC'07	mAP	62.7	ZSD-SCR
2D Object Detection	MS-COCO	Recall	65.1	ZSD-SCR
2D Object Detection	MS-COCO	mAP	20.1	ZSD-SCR
2D Object Detection	PASCAL VOC'07	mAP	62.7	ZSD-SCR
16k	MS-COCO	Recall	65.1	ZSD-SCR
16k	MS-COCO	mAP	20.1	ZSD-SCR
16k	PASCAL VOC'07	mAP	62.7	ZSD-SCR

Resolving Semantic Confusions for Improved Zero-Shot Detection

Abstract

Results

Related Papers

Resolving Semantic Confusions for Improved Zero-Shot Detection

Abstract

Results

Related Papers