TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DETReg: Unsupervised Pretraining with Region Priors for Ob...

DETReg: Unsupervised Pretraining with Region Priors for Object Detection

Amir Bar, Xin Wang, Vadim Kantorov, Colorado J Reed, Roei Herzig, Gal Chechik, Anna Rohrbach, Trevor Darrell, Amir Globerson

2021-06-08CVPR 2022 1Few-Shot LearningFew-Shot Object DetectionRegion ProposalObject Localizationobject-detectionObject DetectionSemi-Supervised Object DetectionUnsupervised Instance Segmentation
PaperPDFCode(official)

Abstract

Recent self-supervised pretraining methods for object detection largely focus on pretraining the backbone of the object detector, neglecting key parts of detection architecture. Instead, we introduce DETReg, a new self-supervised method that pretrains the entire object detection network, including the object localization and embedding components. During pretraining, DETReg predicts object localizations to match the localizations from an unsupervised region proposal generator and simultaneously aligns the corresponding feature embeddings with embeddings from a self-supervised image encoder. We implement DETReg using the DETR family of detectors and show that it improves over competitive baselines when finetuned on COCO, PASCAL VOC, and Airbus Ship benchmarks. In low-data regimes DETReg achieves improved performance, e.g., when training with only 1% of the labels and in the few-shot learning settings.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 10%AP51.4DETReg (ours)
Object DetectionPASCAL VOC 10%AP5072.2DETReg (ours)
Object DetectionPASCAL VOC 10%AP7556.6DETReg (ours)
Object DetectionCOCO 2017AP30DETReg (ours)
Object DetectionMS-COCO (30-shot)AP30DETReg-ft-full DDETR
Object DetectionMS-COCO (10-shot)AP25DETReg-ft-full DDETR
3DPASCAL VOC 10%AP51.4DETReg (ours)
3DPASCAL VOC 10%AP5072.2DETReg (ours)
3DPASCAL VOC 10%AP7556.6DETReg (ours)
3DCOCO 2017AP30DETReg (ours)
3DMS-COCO (30-shot)AP30DETReg-ft-full DDETR
3DMS-COCO (10-shot)AP25DETReg-ft-full DDETR
Few-Shot Object DetectionCOCO 2017AP30DETReg (ours)
Few-Shot Object DetectionMS-COCO (30-shot)AP30DETReg-ft-full DDETR
Few-Shot Object DetectionMS-COCO (10-shot)AP25DETReg-ft-full DDETR
2D ClassificationPASCAL VOC 10%AP51.4DETReg (ours)
2D ClassificationPASCAL VOC 10%AP5072.2DETReg (ours)
2D ClassificationPASCAL VOC 10%AP7556.6DETReg (ours)
2D ClassificationCOCO 2017AP30DETReg (ours)
2D ClassificationMS-COCO (30-shot)AP30DETReg-ft-full DDETR
2D ClassificationMS-COCO (10-shot)AP25DETReg-ft-full DDETR
2D Object DetectionPASCAL VOC 10%AP51.4DETReg (ours)
2D Object DetectionPASCAL VOC 10%AP5072.2DETReg (ours)
2D Object DetectionPASCAL VOC 10%AP7556.6DETReg (ours)
2D Object DetectionCOCO 2017AP30DETReg (ours)
2D Object DetectionMS-COCO (30-shot)AP30DETReg-ft-full DDETR
2D Object DetectionMS-COCO (10-shot)AP25DETReg-ft-full DDETR
Unsupervised Instance SegmentationCOCO val2017AP3.3DETReg
Unsupervised Instance SegmentationCOCO val2017AP508.8DETReg
Unsupervised Instance SegmentationCOCO val2017AP751.9DETReg
16kPASCAL VOC 10%AP51.4DETReg (ours)
16kPASCAL VOC 10%AP5072.2DETReg (ours)
16kPASCAL VOC 10%AP7556.6DETReg (ours)
16kCOCO 2017AP30DETReg (ours)
16kMS-COCO (30-shot)AP30DETReg-ft-full DDETR
16kMS-COCO (10-shot)AP25DETReg-ft-full DDETR

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10