TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FSCE: Few-Shot Object Detection via Contrastive Proposal E...

FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding

Bo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang

2021-03-10CVPR 2021 1Few-Shot LearningFew-Shot Object DetectionImage AugmentationContrastive Learningobject-detectionCross-Domain Few-Shot Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

Emerging interests have been brought to recognize previously unseen objects given very few training examples, known as few-shot object detection (FSOD). Recent researches demonstrate that good feature embedding is the key to reach favorable few-shot learning performance. We observe object proposals with different Intersection-of-Union (IoU) scores are analogous to the intra-image augmentation used in contrastive approaches. And we exploit this analogy and incorporate supervised contrastive learning to achieve more robust objects representations in FSOD. We present Few-Shot object detection via Contrastive proposals Encoding (FSCE), a simple yet effective approach to learning contrastive-aware object proposal encodings that facilitate the classification of detected objects. We notice the degradation of average precision (AP) for rare objects mainly comes from misclassifying novel instances as confusable classes. And we ease the misclassification issues by promoting instance level intra-class compactness and inter-class variance via our contrastive proposal encoding loss (CPE loss). Our design outperforms current state-of-the-art works in any shot and all data splits, with up to +8.8% on standard benchmark PASCAL VOC and +2.7% on challenging COCO benchmark. Code is available at: https: //github.com/MegviiDetection/FSCE

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCO (30-shot)AP15.3FSCE
Object DetectionMS-COCO (10-shot)AP11.1FSCE
Object DetectionArtaxor mAP15.9FSCE
Object DetectionDIORmAP21.9FSCE
Object DetectionUODDmAP12FSCE
3DMS-COCO (30-shot)AP15.3FSCE
3DMS-COCO (10-shot)AP11.1FSCE
3DArtaxor mAP15.9FSCE
3DDIORmAP21.9FSCE
3DUODDmAP12FSCE
Few-Shot Object DetectionMS-COCO (30-shot)AP15.3FSCE
Few-Shot Object DetectionMS-COCO (10-shot)AP11.1FSCE
Few-Shot Object DetectionArtaxor mAP15.9FSCE
Few-Shot Object DetectionDIORmAP21.9FSCE
Few-Shot Object DetectionUODDmAP12FSCE
2D ClassificationMS-COCO (30-shot)AP15.3FSCE
2D ClassificationMS-COCO (10-shot)AP11.1FSCE
2D ClassificationArtaxor mAP15.9FSCE
2D ClassificationDIORmAP21.9FSCE
2D ClassificationUODDmAP12FSCE
2D Object DetectionMS-COCO (30-shot)AP15.3FSCE
2D Object DetectionMS-COCO (10-shot)AP11.1FSCE
2D Object DetectionArtaxor mAP15.9FSCE
2D Object DetectionDIORmAP21.9FSCE
2D Object DetectionUODDmAP12FSCE
16kMS-COCO (30-shot)AP15.3FSCE
16kMS-COCO (10-shot)AP11.1FSCE
16kArtaxor mAP15.9FSCE
16kDIORmAP21.9FSCE
16kUODDmAP12FSCE

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17