TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Anchor DETR: Query Design for Transformer-Based Object Det...

Anchor DETR: Query Design for Transformer-Based Object Detection

Yingming Wang, Xiangyu Zhang, Tong Yang, Jian Sun

2021-09-15object-detectionObject Detection
PaperPDFCode(official)Code(official)

Abstract

In this paper, we propose a novel query design for the transformer-based object detection. In previous transformer-based detectors, the object queries are a set of learned embeddings. However, each learned embedding does not have an explicit physical meaning and we cannot explain where it will focus on. It is difficult to optimize as the prediction slot of each object query does not have a specific mode. In other words, each object query will not focus on a specific region. To solved these problems, in our query design, object queries are based on anchor points, which are widely used in CNN-based detectors. So each object query focuses on the objects near the anchor point. Moreover, our query design can predict multiple objects at one position to solve the difficulty: "one region, multiple objects". In addition, we design an attention variant, which can reduce the memory cost while achieving similar or better performance than the standard attention in DETR. Thanks to the query design and the attention variant, the proposed detector that we called Anchor DETR, can achieve better performance and run faster than the DETR with 10$\times$ fewer training epochs. For example, it achieves 44.2 AP with 19 FPS on the MSCOCO dataset when using the ResNet50-DC5 feature for training 50 epochs. Extensive experiments on the MSCOCO benchmark prove the effectiveness of the proposed methods. Code is available at \url{https://github.com/megvii-research/AnchorDETR}.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5065.7Anchor DETR-DC5-R101
Object DetectionCOCO minivalAP7548.8Anchor DETR-DC5-R101
Object DetectionCOCO minivalAPL61.6Anchor DETR-DC5-R101
Object DetectionCOCO minivalAPM49.4Anchor DETR-DC5-R101
Object DetectionCOCO minivalAPS25.8Anchor DETR-DC5-R101
Object DetectionCOCO minivalbox AP45.1Anchor DETR-DC5-R101
Object DetectionCOCO minivalAP5064.7Anchor DETR-DC5-R50
Object DetectionCOCO minivalAP7547.5Anchor DETR-DC5-R50
Object DetectionCOCO minivalAPL60.6Anchor DETR-DC5-R50
Object DetectionCOCO minivalAPM48.2Anchor DETR-DC5-R50
Object DetectionCOCO minivalAPS24.7Anchor DETR-DC5-R50
Object DetectionCOCO minivalbox AP44.2Anchor DETR-DC5-R50
3DCOCO minivalAP5065.7Anchor DETR-DC5-R101
3DCOCO minivalAP7548.8Anchor DETR-DC5-R101
3DCOCO minivalAPL61.6Anchor DETR-DC5-R101
3DCOCO minivalAPM49.4Anchor DETR-DC5-R101
3DCOCO minivalAPS25.8Anchor DETR-DC5-R101
3DCOCO minivalbox AP45.1Anchor DETR-DC5-R101
3DCOCO minivalAP5064.7Anchor DETR-DC5-R50
3DCOCO minivalAP7547.5Anchor DETR-DC5-R50
3DCOCO minivalAPL60.6Anchor DETR-DC5-R50
3DCOCO minivalAPM48.2Anchor DETR-DC5-R50
3DCOCO minivalAPS24.7Anchor DETR-DC5-R50
3DCOCO minivalbox AP44.2Anchor DETR-DC5-R50
2D ClassificationCOCO minivalAP5065.7Anchor DETR-DC5-R101
2D ClassificationCOCO minivalAP7548.8Anchor DETR-DC5-R101
2D ClassificationCOCO minivalAPL61.6Anchor DETR-DC5-R101
2D ClassificationCOCO minivalAPM49.4Anchor DETR-DC5-R101
2D ClassificationCOCO minivalAPS25.8Anchor DETR-DC5-R101
2D ClassificationCOCO minivalbox AP45.1Anchor DETR-DC5-R101
2D ClassificationCOCO minivalAP5064.7Anchor DETR-DC5-R50
2D ClassificationCOCO minivalAP7547.5Anchor DETR-DC5-R50
2D ClassificationCOCO minivalAPL60.6Anchor DETR-DC5-R50
2D ClassificationCOCO minivalAPM48.2Anchor DETR-DC5-R50
2D ClassificationCOCO minivalAPS24.7Anchor DETR-DC5-R50
2D ClassificationCOCO minivalbox AP44.2Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalAP5065.7Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalAP7548.8Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalAPL61.6Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalAPM49.4Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalAPS25.8Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalbox AP45.1Anchor DETR-DC5-R101
2D Object DetectionCOCO minivalAP5064.7Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalAP7547.5Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalAPL60.6Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalAPM48.2Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalAPS24.7Anchor DETR-DC5-R50
2D Object DetectionCOCO minivalbox AP44.2Anchor DETR-DC5-R50
16kCOCO minivalAP5065.7Anchor DETR-DC5-R101
16kCOCO minivalAP7548.8Anchor DETR-DC5-R101
16kCOCO minivalAPL61.6Anchor DETR-DC5-R101
16kCOCO minivalAPM49.4Anchor DETR-DC5-R101
16kCOCO minivalAPS25.8Anchor DETR-DC5-R101
16kCOCO minivalbox AP45.1Anchor DETR-DC5-R101
16kCOCO minivalAP5064.7Anchor DETR-DC5-R50
16kCOCO minivalAP7547.5Anchor DETR-DC5-R50
16kCOCO minivalAPL60.6Anchor DETR-DC5-R50
16kCOCO minivalAPM48.2Anchor DETR-DC5-R50
16kCOCO minivalAPS24.7Anchor DETR-DC5-R50
16kCOCO minivalbox AP44.2Anchor DETR-DC5-R50

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07