TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Meta-DETR: Image-Level Few-Shot Object Detection with Inte...

Meta-DETR: Image-Level Few-Shot Object Detection with Inter-Class Correlation Exploitation

Gongjie Zhang, Zhipeng Luo, Kaiwen Cui, Shijian Lu

2021-03-22Meta-LearningFew-Shot Object DetectionObject Localizationobject-detectionObject Detection
PaperPDFCode(official)Code

Abstract

Few-shot object detection has been extensively investigated by incorporating meta-learning into region-based detection frameworks. Despite its success, the said paradigm is constrained by several factors, such as (i) low-quality region proposals for novel classes and (ii) negligence of the inter-class correlation among different classes. Such limitations hinder the generalization of base-class knowledge for the detection of novel-class objects. In this work, we design Meta-DETR, a novel few-shot detection framework that incorporates correlational aggregation for meta-learning into DETR detection frameworks. Meta-DETR works entirely at image level without any region proposals, which circumvents the constraint of inaccurate proposals in prevalent few-shot detection frameworks. Besides, Meta-DETR can simultaneously attend to multiple support classes within a single feed-forward. This unique design allows capturing the inter-class correlation among different classes, which significantly reduces the misclassification of similar classes and enhances knowledge generalization to novel classes. Experiments over multiple few-shot object detection benchmarks show that the proposed Meta-DETR outperforms state-of-the-art methods by large margins. The implementation codes will be released at https://github.com/ZhangGongjie/Meta-DETR.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
Object DetectionMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
Object DetectionMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
Object DetectionMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)
3DMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
3DMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
3DMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
3DMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)
Few-Shot Object DetectionMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
Few-Shot Object DetectionMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
Few-Shot Object DetectionMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
Few-Shot Object DetectionMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)
2D ClassificationMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
2D ClassificationMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
2D ClassificationMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
2D ClassificationMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)
2D Object DetectionMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
2D Object DetectionMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
2D Object DetectionMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
2D Object DetectionMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)
16kMS-COCO (30-shot)AP22.9Meta-DETR (Multi-Scale Feature)
16kMS-COCO (30-shot)AP21.3Meta-DETR (Single-Scale Feature)
16kMS-COCO (10-shot)AP17.8Meta-DETR (Multi-Scale Feature)
16kMS-COCO (10-shot)AP16.7Meta-DETR (Single-Scale Feature)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16