TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DN-DETR: Accelerate DETR Training by Introducing Query DeN...

DN-DETR: Accelerate DETR Training by Introducing Query DeNoising

Feng Li, Hao Zhang, Shilong Liu, Jian Guo, Lionel M. Ni, Lei Zhang

2022-03-02CVPR 2022 1Object Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCode(official)CodeCode(official)CodeCodeCodeCode

Abstract

We present in this paper a novel denoising training method to speedup DETR (DEtection TRansformer) training and offer a deepened understanding of the slow convergence issue of DETR-like methods. We show that the slow convergence results from the instability of bipartite graph matching which causes inconsistent optimization goals in early training stages. To address this issue, except for the Hungarian loss, our method additionally feeds ground-truth bounding boxes with noises into Transformer decoder and trains the model to reconstruct the original boxes, which effectively reduces the bipartite graph matching difficulty and leads to a faster convergence. Our method is universal and can be easily plugged into any DETR-like methods by adding dozens of lines of code to achieve a remarkable improvement. As a result, our DN-DETR results in a remarkable improvement ($+1.9$AP) under the same setting and achieves the best result (AP $43.4$ and $48.6$ with $12$ and $50$ epochs of training respectively) among DETR-like methods with ResNet-$50$ backbone. Compared with the baseline under the same setting, DN-DETR achieves comparable performance with $50\%$ training epochs. Code is available at \url{https://github.com/FengLi-ust/DN-DETR}.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5067.6DN-Deformable-DETR-R50++
Object DetectionCOCO minivalAP7553.8DN-Deformable-DETR-R50++
Object DetectionCOCO minivalAPL65.4DN-Deformable-DETR-R50++
Object DetectionCOCO minivalAPM52.6DN-Deformable-DETR-R50++
Object DetectionCOCO minivalAPS31.3DN-Deformable-DETR-R50++
Object DetectionCOCO minivalParams (M)47DN-Deformable-DETR-R50++
Object DetectionCOCO minivalbox AP49.5DN-Deformable-DETR-R50++
3DCOCO minivalAP5067.6DN-Deformable-DETR-R50++
3DCOCO minivalAP7553.8DN-Deformable-DETR-R50++
3DCOCO minivalAPL65.4DN-Deformable-DETR-R50++
3DCOCO minivalAPM52.6DN-Deformable-DETR-R50++
3DCOCO minivalAPS31.3DN-Deformable-DETR-R50++
3DCOCO minivalParams (M)47DN-Deformable-DETR-R50++
3DCOCO minivalbox AP49.5DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalAP5067.6DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalAP7553.8DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalAPL65.4DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalAPM52.6DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalAPS31.3DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalParams (M)47DN-Deformable-DETR-R50++
2D ClassificationCOCO minivalbox AP49.5DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalAP5067.6DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalAP7553.8DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalAPL65.4DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalAPM52.6DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalAPS31.3DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalParams (M)47DN-Deformable-DETR-R50++
2D Object DetectionCOCO minivalbox AP49.5DN-Deformable-DETR-R50++
16kCOCO minivalAP5067.6DN-Deformable-DETR-R50++
16kCOCO minivalAP7553.8DN-Deformable-DETR-R50++
16kCOCO minivalAPL65.4DN-Deformable-DETR-R50++
16kCOCO minivalAPM52.6DN-Deformable-DETR-R50++
16kCOCO minivalAPS31.3DN-Deformable-DETR-R50++
16kCOCO minivalParams (M)47DN-Deformable-DETR-R50++
16kCOCO minivalbox AP49.5DN-Deformable-DETR-R50++

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07