TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Frustratingly Simple Few-Shot Object Detection

Frustratingly Simple Few-Shot Object Detection

Xin Wang, Thomas E. Huang, Trevor Darrell, Joseph E. Gonzalez, Fisher Yu

2020-03-16ICML 2020 1Meta-LearningFew-Shot Object Detectionobject-detectionCross-Domain Few-Shot Object DetectionObject Detection
PaperPDFCode(official)CodeCodeCodeCode

Abstract

Detecting rare objects from a few examples is an emerging problem. Prior works show meta-learning is a promising approach. But, fine-tuning techniques have drawn scant attention. We find that fine-tuning only the last layer of existing detectors on rare classes is crucial to the few-shot object detection task. Such a simple approach outperforms the meta-learning methods by roughly 2~20 points on current benchmarks and sometimes even doubles the accuracy of the prior methods. However, the high variance in the few samples often leads to the unreliability of existing benchmarks. We revise the evaluation protocols by sampling multiple groups of training examples to obtain stable comparisons and build new benchmarks based on three datasets: PASCAL VOC, COCO and LVIS. Again, our fine-tuning approach establishes a new state of the art on the revised benchmarks. The code as well as the pretrained models are available at https://github.com/ucbdrive/few-shot-object-detection.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCO (30-shot)AP13.7TFA w/ cos
Object DetectionMS-COCO (30-shot)AP13.4TFA w/ fc
Object DetectionMS-COCO (10-shot)AP10TFA(w/cos)
Object DetectionMS-COCO (10-shot)AP10TFA(w/fc)
Object DetectionArtaxor mAP14.8TFA w/cos
Object DetectionDIORmAP20.5TFA w/cos
Object DetectionUODDmAP11.8TFA w/cos
3DMS-COCO (30-shot)AP13.7TFA w/ cos
3DMS-COCO (30-shot)AP13.4TFA w/ fc
3DMS-COCO (10-shot)AP10TFA(w/cos)
3DMS-COCO (10-shot)AP10TFA(w/fc)
3DArtaxor mAP14.8TFA w/cos
3DDIORmAP20.5TFA w/cos
3DUODDmAP11.8TFA w/cos
Few-Shot Object DetectionMS-COCO (30-shot)AP13.7TFA w/ cos
Few-Shot Object DetectionMS-COCO (30-shot)AP13.4TFA w/ fc
Few-Shot Object DetectionMS-COCO (10-shot)AP10TFA(w/cos)
Few-Shot Object DetectionMS-COCO (10-shot)AP10TFA(w/fc)
Few-Shot Object DetectionArtaxor mAP14.8TFA w/cos
Few-Shot Object DetectionDIORmAP20.5TFA w/cos
Few-Shot Object DetectionUODDmAP11.8TFA w/cos
2D ClassificationMS-COCO (30-shot)AP13.7TFA w/ cos
2D ClassificationMS-COCO (30-shot)AP13.4TFA w/ fc
2D ClassificationMS-COCO (10-shot)AP10TFA(w/cos)
2D ClassificationMS-COCO (10-shot)AP10TFA(w/fc)
2D ClassificationArtaxor mAP14.8TFA w/cos
2D ClassificationDIORmAP20.5TFA w/cos
2D ClassificationUODDmAP11.8TFA w/cos
2D Object DetectionMS-COCO (30-shot)AP13.7TFA w/ cos
2D Object DetectionMS-COCO (30-shot)AP13.4TFA w/ fc
2D Object DetectionMS-COCO (10-shot)AP10TFA(w/cos)
2D Object DetectionMS-COCO (10-shot)AP10TFA(w/fc)
2D Object DetectionArtaxor mAP14.8TFA w/cos
2D Object DetectionDIORmAP20.5TFA w/cos
2D Object DetectionUODDmAP11.8TFA w/cos
16kMS-COCO (30-shot)AP13.7TFA w/ cos
16kMS-COCO (30-shot)AP13.4TFA w/ fc
16kMS-COCO (10-shot)AP10TFA(w/cos)
16kMS-COCO (10-shot)AP10TFA(w/fc)
16kArtaxor mAP14.8TFA w/cos
16kDIORmAP20.5TFA w/cos
16kUODDmAP11.8TFA w/cos

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16