TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Conditional DETR for Fast Training Convergence

Conditional DETR for Fast Training Convergence

Depu Meng, Xiaokang Chen, Zejia Fan, Gang Zeng, Houqiang Li, Yuhui Yuan, Lei Sun, Jingdong Wang

2021-08-13ICCV 2021 10object-detectionObject Detection
PaperPDFCodeCodeCodeCode(official)

Abstract

The recently-developed DETR approach applies the transformer encoder and decoder architecture to object detection and achieves promising performance. In this paper, we handle the critical issue, slow training convergence, and present a conditional cross-attention mechanism for fast DETR training. Our approach is motivated by that the cross-attention in DETR relies highly on the content embeddings for localizing the four extremities and predicting the box, which increases the need for high-quality content embeddings and thus the training difficulty. Our approach, named conditional DETR, learns a conditional spatial query from the decoder embedding for decoder multi-head cross-attention. The benefit is that through the conditional spatial query, each cross-attention head is able to attend to a band containing a distinct region, e.g., one object extremity or a region inside the object box. This narrows down the spatial range for localizing the distinct regions for object classification and box regression, thus relaxing the dependence on the content embeddings and easing the training. Empirical results show that conditional DETR converges 6.7x faster for the backbones R50 and R101 and 10x faster for stronger backbones DC5-R50 and DC5-R101. Code is available at https://github.com/Atten4Vis/ConditionalDETR.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5066.8Conditional DETR-DC5-R101
Object DetectionCOCO minivalAP7549.5Conditional DETR-DC5-R101
Object DetectionCOCO minivalAPL63.3Conditional DETR-DC5-R101
Object DetectionCOCO minivalAPM50.3Conditional DETR-DC5-R101
Object DetectionCOCO minivalAPS27.2Conditional DETR-DC5-R101
Object DetectionCOCO minivalParams (M)63Conditional DETR-DC5-R101
Object DetectionCOCO minivalbox AP45.9Conditional DETR-DC5-R101
Object DetectionCOCO minivalAP5065.4Conditional DETR-DC5-R50
Object DetectionCOCO minivalAP7548.5Conditional DETR-DC5-R50
Object DetectionCOCO minivalAPL62.2Conditional DETR-DC5-R50
Object DetectionCOCO minivalAPM49Conditional DETR-DC5-R50
Object DetectionCOCO minivalAPS25.3Conditional DETR-DC5-R50
Object DetectionCOCO minivalParams (M)44Conditional DETR-DC5-R50
Object DetectionCOCO minivalbox AP45.1Conditional DETR-DC5-R50
Object DetectionCOCO minivalAP5065.6Conditional DETR-R101
Object DetectionCOCO minivalAP7547.5Conditional DETR-R101
Object DetectionCOCO minivalAPL63.6Conditional DETR-R101
Object DetectionCOCO minivalAPM48.4Conditional DETR-R101
Object DetectionCOCO minivalAPS23.6Conditional DETR-R101
Object DetectionCOCO minivalParams (M)63Conditional DETR-R101
Object DetectionCOCO minivalbox AP44.5Conditional DETR-R101
Object DetectionCOCO minivalAP5064Conditional DETR-R50
Object DetectionCOCO minivalAP7545.7Conditional DETR-R50
Object DetectionCOCO minivalAPL61.5Conditional DETR-R50
Object DetectionCOCO minivalAPM46.7Conditional DETR-R50
Object DetectionCOCO minivalAPS22.7Conditional DETR-R50
Object DetectionCOCO minivalParams (M)44Conditional DETR-R50
Object DetectionCOCO minivalbox AP43Conditional DETR-R50
3DCOCO minivalAP5066.8Conditional DETR-DC5-R101
3DCOCO minivalAP7549.5Conditional DETR-DC5-R101
3DCOCO minivalAPL63.3Conditional DETR-DC5-R101
3DCOCO minivalAPM50.3Conditional DETR-DC5-R101
3DCOCO minivalAPS27.2Conditional DETR-DC5-R101
3DCOCO minivalParams (M)63Conditional DETR-DC5-R101
3DCOCO minivalbox AP45.9Conditional DETR-DC5-R101
3DCOCO minivalAP5065.4Conditional DETR-DC5-R50
3DCOCO minivalAP7548.5Conditional DETR-DC5-R50
3DCOCO minivalAPL62.2Conditional DETR-DC5-R50
3DCOCO minivalAPM49Conditional DETR-DC5-R50
3DCOCO minivalAPS25.3Conditional DETR-DC5-R50
3DCOCO minivalParams (M)44Conditional DETR-DC5-R50
3DCOCO minivalbox AP45.1Conditional DETR-DC5-R50
3DCOCO minivalAP5065.6Conditional DETR-R101
3DCOCO minivalAP7547.5Conditional DETR-R101
3DCOCO minivalAPL63.6Conditional DETR-R101
3DCOCO minivalAPM48.4Conditional DETR-R101
3DCOCO minivalAPS23.6Conditional DETR-R101
3DCOCO minivalParams (M)63Conditional DETR-R101
3DCOCO minivalbox AP44.5Conditional DETR-R101
3DCOCO minivalAP5064Conditional DETR-R50
3DCOCO minivalAP7545.7Conditional DETR-R50
3DCOCO minivalAPL61.5Conditional DETR-R50
3DCOCO minivalAPM46.7Conditional DETR-R50
3DCOCO minivalAPS22.7Conditional DETR-R50
3DCOCO minivalParams (M)44Conditional DETR-R50
3DCOCO minivalbox AP43Conditional DETR-R50
2D ClassificationCOCO minivalAP5066.8Conditional DETR-DC5-R101
2D ClassificationCOCO minivalAP7549.5Conditional DETR-DC5-R101
2D ClassificationCOCO minivalAPL63.3Conditional DETR-DC5-R101
2D ClassificationCOCO minivalAPM50.3Conditional DETR-DC5-R101
2D ClassificationCOCO minivalAPS27.2Conditional DETR-DC5-R101
2D ClassificationCOCO minivalParams (M)63Conditional DETR-DC5-R101
2D ClassificationCOCO minivalbox AP45.9Conditional DETR-DC5-R101
2D ClassificationCOCO minivalAP5065.4Conditional DETR-DC5-R50
2D ClassificationCOCO minivalAP7548.5Conditional DETR-DC5-R50
2D ClassificationCOCO minivalAPL62.2Conditional DETR-DC5-R50
2D ClassificationCOCO minivalAPM49Conditional DETR-DC5-R50
2D ClassificationCOCO minivalAPS25.3Conditional DETR-DC5-R50
2D ClassificationCOCO minivalParams (M)44Conditional DETR-DC5-R50
2D ClassificationCOCO minivalbox AP45.1Conditional DETR-DC5-R50
2D ClassificationCOCO minivalAP5065.6Conditional DETR-R101
2D ClassificationCOCO minivalAP7547.5Conditional DETR-R101
2D ClassificationCOCO minivalAPL63.6Conditional DETR-R101
2D ClassificationCOCO minivalAPM48.4Conditional DETR-R101
2D ClassificationCOCO minivalAPS23.6Conditional DETR-R101
2D ClassificationCOCO minivalParams (M)63Conditional DETR-R101
2D ClassificationCOCO minivalbox AP44.5Conditional DETR-R101
2D ClassificationCOCO minivalAP5064Conditional DETR-R50
2D ClassificationCOCO minivalAP7545.7Conditional DETR-R50
2D ClassificationCOCO minivalAPL61.5Conditional DETR-R50
2D ClassificationCOCO minivalAPM46.7Conditional DETR-R50
2D ClassificationCOCO minivalAPS22.7Conditional DETR-R50
2D ClassificationCOCO minivalParams (M)44Conditional DETR-R50
2D ClassificationCOCO minivalbox AP43Conditional DETR-R50
2D Object DetectionCOCO minivalAP5066.8Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalAP7549.5Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalAPL63.3Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalAPM50.3Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalAPS27.2Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalParams (M)63Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalbox AP45.9Conditional DETR-DC5-R101
2D Object DetectionCOCO minivalAP5065.4Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalAP7548.5Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalAPL62.2Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalAPM49Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalAPS25.3Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalParams (M)44Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalbox AP45.1Conditional DETR-DC5-R50
2D Object DetectionCOCO minivalAP5065.6Conditional DETR-R101
2D Object DetectionCOCO minivalAP7547.5Conditional DETR-R101
2D Object DetectionCOCO minivalAPL63.6Conditional DETR-R101
2D Object DetectionCOCO minivalAPM48.4Conditional DETR-R101
2D Object DetectionCOCO minivalAPS23.6Conditional DETR-R101
2D Object DetectionCOCO minivalParams (M)63Conditional DETR-R101
2D Object DetectionCOCO minivalbox AP44.5Conditional DETR-R101
2D Object DetectionCOCO minivalAP5064Conditional DETR-R50
2D Object DetectionCOCO minivalAP7545.7Conditional DETR-R50
2D Object DetectionCOCO minivalAPL61.5Conditional DETR-R50
2D Object DetectionCOCO minivalAPM46.7Conditional DETR-R50
2D Object DetectionCOCO minivalAPS22.7Conditional DETR-R50
2D Object DetectionCOCO minivalParams (M)44Conditional DETR-R50
2D Object DetectionCOCO minivalbox AP43Conditional DETR-R50
16kCOCO minivalAP5066.8Conditional DETR-DC5-R101
16kCOCO minivalAP7549.5Conditional DETR-DC5-R101
16kCOCO minivalAPL63.3Conditional DETR-DC5-R101
16kCOCO minivalAPM50.3Conditional DETR-DC5-R101
16kCOCO minivalAPS27.2Conditional DETR-DC5-R101
16kCOCO minivalParams (M)63Conditional DETR-DC5-R101
16kCOCO minivalbox AP45.9Conditional DETR-DC5-R101
16kCOCO minivalAP5065.4Conditional DETR-DC5-R50
16kCOCO minivalAP7548.5Conditional DETR-DC5-R50
16kCOCO minivalAPL62.2Conditional DETR-DC5-R50
16kCOCO minivalAPM49Conditional DETR-DC5-R50
16kCOCO minivalAPS25.3Conditional DETR-DC5-R50
16kCOCO minivalParams (M)44Conditional DETR-DC5-R50
16kCOCO minivalbox AP45.1Conditional DETR-DC5-R50
16kCOCO minivalAP5065.6Conditional DETR-R101
16kCOCO minivalAP7547.5Conditional DETR-R101
16kCOCO minivalAPL63.6Conditional DETR-R101
16kCOCO minivalAPM48.4Conditional DETR-R101
16kCOCO minivalAPS23.6Conditional DETR-R101
16kCOCO minivalParams (M)63Conditional DETR-R101
16kCOCO minivalbox AP44.5Conditional DETR-R101
16kCOCO minivalAP5064Conditional DETR-R50
16kCOCO minivalAP7545.7Conditional DETR-R50
16kCOCO minivalAPL61.5Conditional DETR-R50
16kCOCO minivalAPM46.7Conditional DETR-R50
16kCOCO minivalAPS22.7Conditional DETR-R50
16kCOCO minivalParams (M)44Conditional DETR-R50
16kCOCO minivalbox AP43Conditional DETR-R50

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07