TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Relation DETR: Exploring Explicit Position Relation Prior ...

Relation DETR: Exploring Explicit Position Relation Prior for Object Detection

Xiuquan Hou, Meiqin Liu, Senlin Zhang, Ping Wei, Badong Chen, Xuguang Lan

2024-07-162D Object Detectionobject-detectionObject Detection
PaperPDFCodeCode(official)

Abstract

This paper presents a general scheme for enhancing the convergence and performance of DETR (DEtection TRansformer). We investigate the slow convergence problem in transformers from a new perspective, suggesting that it arises from the self-attention that introduces no structural bias over inputs. To address this issue, we explore incorporating position relation prior as attention bias to augment object detection, following the verification of its statistical significance using a proposed quantitative macroscopic correlation (MC) metric. Our approach, termed Relation-DETR, introduces an encoder to construct position relation embeddings for progressive attention refinement, which further extends the traditional streaming pipeline of DETR into a contrastive relation pipeline to address the conflicts between non-duplicate predictions and positive supervision. Extensive experiments on both generic and task-specific datasets demonstrate the effectiveness of our approach. Under the same configurations, Relation-DETR achieves a significant improvement (+2.0% AP compared to DINO), state-of-the-art performance (51.7% AP for 1x and 52.1% AP for 2x settings), and a remarkably faster convergence speed (over 40% AP with only 2 training epochs) than existing DETR detectors on COCO val2017. Moreover, the proposed relation encoder serves as a universal plug-in-and-play component, bringing clear improvements for theoretically any DETR-like methods. Furthermore, we introduce a class-agnostic detection dataset, SA-Det-100k. The experimental results on the dataset illustrate that the proposed explicit position relation achieves a clear improvement of 1.3% AP, highlighting its potential towards universal object detection. The code and dataset are available at https://github.com/xiuqhou/Relation-DETR.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5080.8Relation-DETR (Focal-L)
Object DetectionCOCO test-devAP7569.1Relation-DETR (Focal-L)
Object DetectionCOCO test-devAPL77Relation-DETR (Focal-L)
Object DetectionCOCO test-devAPM66.9Relation-DETR (Focal-L)
Object DetectionCOCO test-devAPS47.2Relation-DETR (Focal-L)
Object DetectionCOCO test-devParams (M)214Relation-DETR (Focal-L)
Object DetectionCOCO test-devbox mAP63.5Relation-DETR (Focal-L)
Object DetectionCOCO 2017 valAP58.1Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAP5076.4Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAP7563.5Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAPL73.5Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAPM63Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAPS41.8Relation-DETR (Swin-L 2x)
Object DetectionCOCO 2017 valAP57.8Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP5076.1Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP7562.9Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPL74.4Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPM62.1Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAPS41.2Relation-DETR (Swin-L 1x)
Object DetectionCOCO 2017 valAP52.1Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAP5069.7Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAP7556.6Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAPL66.5Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAPM56Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAPS36.1Relation-DETR (ResNet50 2x)
Object DetectionCOCO 2017 valAP51.7Relation-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAP5069.1Relation-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAP7556.3Relation-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPL66.1Relation-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPM55.6Relation-DETR (ResNet50 1x)
Object DetectionCOCO 2017 valAPS36.1Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAP45Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAP5053.1Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAP7548.9Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAPL62.9Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAPM44.4Relation-DETR (ResNet50 1x)
Object DetectionSA-Det-100kAPS6Relation-DETR (ResNet50 1x)
3DCOCO test-devAP5080.8Relation-DETR (Focal-L)
3DCOCO test-devAP7569.1Relation-DETR (Focal-L)
3DCOCO test-devAPL77Relation-DETR (Focal-L)
3DCOCO test-devAPM66.9Relation-DETR (Focal-L)
3DCOCO test-devAPS47.2Relation-DETR (Focal-L)
3DCOCO test-devParams (M)214Relation-DETR (Focal-L)
3DCOCO test-devbox mAP63.5Relation-DETR (Focal-L)
3DCOCO 2017 valAP58.1Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAP5076.4Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAP7563.5Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAPL73.5Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAPM63Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAPS41.8Relation-DETR (Swin-L 2x)
3DCOCO 2017 valAP57.8Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAP5076.1Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAP7562.9Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAPL74.4Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAPM62.1Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAPS41.2Relation-DETR (Swin-L 1x)
3DCOCO 2017 valAP52.1Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAP5069.7Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAP7556.6Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAPL66.5Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAPM56Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAPS36.1Relation-DETR (ResNet50 2x)
3DCOCO 2017 valAP51.7Relation-DETR (ResNet50 1x)
3DCOCO 2017 valAP5069.1Relation-DETR (ResNet50 1x)
3DCOCO 2017 valAP7556.3Relation-DETR (ResNet50 1x)
3DCOCO 2017 valAPL66.1Relation-DETR (ResNet50 1x)
3DCOCO 2017 valAPM55.6Relation-DETR (ResNet50 1x)
3DCOCO 2017 valAPS36.1Relation-DETR (ResNet50 1x)
3DSA-Det-100kAP45Relation-DETR (ResNet50 1x)
3DSA-Det-100kAP5053.1Relation-DETR (ResNet50 1x)
3DSA-Det-100kAP7548.9Relation-DETR (ResNet50 1x)
3DSA-Det-100kAPL62.9Relation-DETR (ResNet50 1x)
3DSA-Det-100kAPM44.4Relation-DETR (ResNet50 1x)
3DSA-Det-100kAPS6Relation-DETR (ResNet50 1x)
2D ClassificationCOCO test-devAP5080.8Relation-DETR (Focal-L)
2D ClassificationCOCO test-devAP7569.1Relation-DETR (Focal-L)
2D ClassificationCOCO test-devAPL77Relation-DETR (Focal-L)
2D ClassificationCOCO test-devAPM66.9Relation-DETR (Focal-L)
2D ClassificationCOCO test-devAPS47.2Relation-DETR (Focal-L)
2D ClassificationCOCO test-devParams (M)214Relation-DETR (Focal-L)
2D ClassificationCOCO test-devbox mAP63.5Relation-DETR (Focal-L)
2D ClassificationCOCO 2017 valAP58.1Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAP5076.4Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAP7563.5Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAPL73.5Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAPM63Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAPS41.8Relation-DETR (Swin-L 2x)
2D ClassificationCOCO 2017 valAP57.8Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP5076.1Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP7562.9Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPL74.4Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPM62.1Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAPS41.2Relation-DETR (Swin-L 1x)
2D ClassificationCOCO 2017 valAP52.1Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAP5069.7Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAP7556.6Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAPL66.5Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAPM56Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAPS36.1Relation-DETR (ResNet50 2x)
2D ClassificationCOCO 2017 valAP51.7Relation-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAP5069.1Relation-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAP7556.3Relation-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPL66.1Relation-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPM55.6Relation-DETR (ResNet50 1x)
2D ClassificationCOCO 2017 valAPS36.1Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAP45Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAP5053.1Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAP7548.9Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAPL62.9Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAPM44.4Relation-DETR (ResNet50 1x)
2D ClassificationSA-Det-100kAPS6Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO test-devAP5080.8Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devAP7569.1Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devAPL77Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devAPM66.9Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devAPS47.2Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devParams (M)214Relation-DETR (Focal-L)
2D Object DetectionCOCO test-devbox mAP63.5Relation-DETR (Focal-L)
2D Object DetectionCOCO 2017 valAP58.1Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAP5076.4Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAP7563.5Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAPL73.5Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAPM63Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAPS41.8Relation-DETR (Swin-L 2x)
2D Object DetectionCOCO 2017 valAP57.8Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP5076.1Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP7562.9Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPL74.4Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPM62.1Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAPS41.2Relation-DETR (Swin-L 1x)
2D Object DetectionCOCO 2017 valAP52.1Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAP5069.7Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAP7556.6Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAPL66.5Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAPM56Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAPS36.1Relation-DETR (ResNet50 2x)
2D Object DetectionCOCO 2017 valAP51.7Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAP5069.1Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAP7556.3Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPL66.1Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPM55.6Relation-DETR (ResNet50 1x)
2D Object DetectionCOCO 2017 valAPS36.1Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAP45Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAP5053.1Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAP7548.9Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAPL62.9Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAPM44.4Relation-DETR (ResNet50 1x)
2D Object DetectionSA-Det-100kAPS6Relation-DETR (ResNet50 1x)
16kCOCO test-devAP5080.8Relation-DETR (Focal-L)
16kCOCO test-devAP7569.1Relation-DETR (Focal-L)
16kCOCO test-devAPL77Relation-DETR (Focal-L)
16kCOCO test-devAPM66.9Relation-DETR (Focal-L)
16kCOCO test-devAPS47.2Relation-DETR (Focal-L)
16kCOCO test-devParams (M)214Relation-DETR (Focal-L)
16kCOCO test-devbox mAP63.5Relation-DETR (Focal-L)
16kCOCO 2017 valAP58.1Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAP5076.4Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAP7563.5Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAPL73.5Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAPM63Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAPS41.8Relation-DETR (Swin-L 2x)
16kCOCO 2017 valAP57.8Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAP5076.1Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAP7562.9Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAPL74.4Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAPM62.1Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAPS41.2Relation-DETR (Swin-L 1x)
16kCOCO 2017 valAP52.1Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAP5069.7Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAP7556.6Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAPL66.5Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAPM56Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAPS36.1Relation-DETR (ResNet50 2x)
16kCOCO 2017 valAP51.7Relation-DETR (ResNet50 1x)
16kCOCO 2017 valAP5069.1Relation-DETR (ResNet50 1x)
16kCOCO 2017 valAP7556.3Relation-DETR (ResNet50 1x)
16kCOCO 2017 valAPL66.1Relation-DETR (ResNet50 1x)
16kCOCO 2017 valAPM55.6Relation-DETR (ResNet50 1x)
16kCOCO 2017 valAPS36.1Relation-DETR (ResNet50 1x)
16kSA-Det-100kAP45Relation-DETR (ResNet50 1x)
16kSA-Det-100kAP5053.1Relation-DETR (ResNet50 1x)
16kSA-Det-100kAP7548.9Relation-DETR (ResNet50 1x)
16kSA-Det-100kAPL62.9Relation-DETR (ResNet50 1x)
16kSA-Det-100kAPM44.4Relation-DETR (ResNet50 1x)
16kSA-Det-100kAPS6Relation-DETR (ResNet50 1x)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07