TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dynamic Head: Unifying Object Detection Heads with Attenti...

Dynamic Head: Unifying Object Detection Heads with Attentions

Xiyang Dai, Yinpeng Chen, Bin Xiao, Dongdong Chen, Mengchen Liu, Lu Yuan, Lei Zhang

2021-06-15CVPR 2021 1object-detectionObject Detection
PaperPDFCode(official)CodeCode

Abstract

The complex nature of combining localization and classification in object detection has resulted in the flourished development of methods. Previous works tried to improve the performance in various object detection heads but failed to present a unified view. In this paper, we present a novel dynamic head framework to unify object detection heads with attentions. By coherently combining multiple self-attention mechanisms between feature levels for scale-awareness, among spatial locations for spatial-awareness, and within output channels for task-awareness, the proposed approach significantly improves the representation ability of object detection heads without any computational overhead. Further experiments demonstrate that the effectiveness and efficiency of the proposed dynamic head on the COCO benchmark. With a standard ResNeXt-101-DCN backbone, we largely improve the performance over popular object detectors and achieve a new state-of-the-art at 54.0 AP. Furthermore, with latest transformer backbone and extra data, we can push current best COCO result to a new record at 60.6 AP. The code will be released at https://github.com/microsoft/DynamicHead.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5078.5DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO test-devAP7566.6DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO test-devAPL74.2DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO test-devAPM64DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO test-devbox mAP60.6DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO test-devAP5077.1DyHead (Swin-L, multi scale)
Object DetectionCOCO test-devAP7564.5DyHead (Swin-L, multi scale)
Object DetectionCOCO test-devAPL72.8DyHead (Swin-L, multi scale)
Object DetectionCOCO test-devAPM62DyHead (Swin-L, multi scale)
Object DetectionCOCO test-devbox mAP58.7DyHead (Swin-L, multi scale)
Object DetectionCOCO test-devAP5072.1DyHead (ResNeXt-64x4d-101-DCN, multi scale)
Object DetectionCOCO test-devAP7559.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
Object DetectionCOCO test-devbox mAP54DyHead (ResNeXt-64x4d-101-DCN, multi scale)
Object DetectionCOCO test-devAP5065.7DyHead (ResNeXt-64x4d-101)
Object DetectionCOCO test-devAP7551.9DyHead (ResNeXt-64x4d-101)
Object DetectionCOCO test-devbox mAP47.7DyHead (ResNeXt-64x4d-101)
Object DetectionCOCO test-devAP5060.7DyHead (ResNet-50)
Object DetectionCOCO test-devAP7546.8DyHead (ResNet-50)
Object DetectionCOCO test-devbox mAP43DyHead (ResNet-50)
Object DetectionCOCO-OAverage mAP35.3DyHead (Swin-L)
Object DetectionCOCO-OEffective Robustness10DyHead (Swin-L)
Object DetectionCOCO-OAverage mAP19.3DyHead (ResNet-50)
Object DetectionCOCO-OEffective Robustness0.16DyHead (ResNet-50)
Object DetectionCOCO 2017 valAP5068DyHead (Swin-T, multi scale)
Object DetectionCOCO 2017 valAP7554.3DyHead (Swin-T, multi scale)
Object DetectionCOCO 2017 valAPL64.2DyHead (Swin-T, multi scale)
Object DetectionCOCO minivalAP5078.2DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO minivalAPL74.2DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO minivalbox AP60.3DyHead (Swin-L, multi scale, self-training)
Object DetectionCOCO minivalAP5076.8DyHead (Swin-L, multi scale)
Object DetectionCOCO minivalAPL73.2DyHead (Swin-L, multi scale)
Object DetectionCOCO minivalAPM62.2DyHead (Swin-L, multi scale)
Object DetectionCOCO minivalAPS44.5DyHead (Swin-L, multi scale)
Object DetectionCOCO minivalbox AP58.4DyHead (Swin-L, multi scale)
Object DetectionCOCO minivalbox AP46.5DyHead (ResNet-101)
Object DetectionCOCO minivalAPL66.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
3DCOCO test-devAP5078.5DyHead (Swin-L, multi scale, self-training)
3DCOCO test-devAP7566.6DyHead (Swin-L, multi scale, self-training)
3DCOCO test-devAPL74.2DyHead (Swin-L, multi scale, self-training)
3DCOCO test-devAPM64DyHead (Swin-L, multi scale, self-training)
3DCOCO test-devbox mAP60.6DyHead (Swin-L, multi scale, self-training)
3DCOCO test-devAP5077.1DyHead (Swin-L, multi scale)
3DCOCO test-devAP7564.5DyHead (Swin-L, multi scale)
3DCOCO test-devAPL72.8DyHead (Swin-L, multi scale)
3DCOCO test-devAPM62DyHead (Swin-L, multi scale)
3DCOCO test-devbox mAP58.7DyHead (Swin-L, multi scale)
3DCOCO test-devAP5072.1DyHead (ResNeXt-64x4d-101-DCN, multi scale)
3DCOCO test-devAP7559.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
3DCOCO test-devbox mAP54DyHead (ResNeXt-64x4d-101-DCN, multi scale)
3DCOCO test-devAP5065.7DyHead (ResNeXt-64x4d-101)
3DCOCO test-devAP7551.9DyHead (ResNeXt-64x4d-101)
3DCOCO test-devbox mAP47.7DyHead (ResNeXt-64x4d-101)
3DCOCO test-devAP5060.7DyHead (ResNet-50)
3DCOCO test-devAP7546.8DyHead (ResNet-50)
3DCOCO test-devbox mAP43DyHead (ResNet-50)
3DCOCO-OAverage mAP35.3DyHead (Swin-L)
3DCOCO-OEffective Robustness10DyHead (Swin-L)
3DCOCO-OAverage mAP19.3DyHead (ResNet-50)
3DCOCO-OEffective Robustness0.16DyHead (ResNet-50)
3DCOCO 2017 valAP5068DyHead (Swin-T, multi scale)
3DCOCO 2017 valAP7554.3DyHead (Swin-T, multi scale)
3DCOCO 2017 valAPL64.2DyHead (Swin-T, multi scale)
3DCOCO minivalAP5078.2DyHead (Swin-L, multi scale, self-training)
3DCOCO minivalAPL74.2DyHead (Swin-L, multi scale, self-training)
3DCOCO minivalbox AP60.3DyHead (Swin-L, multi scale, self-training)
3DCOCO minivalAP5076.8DyHead (Swin-L, multi scale)
3DCOCO minivalAPL73.2DyHead (Swin-L, multi scale)
3DCOCO minivalAPM62.2DyHead (Swin-L, multi scale)
3DCOCO minivalAPS44.5DyHead (Swin-L, multi scale)
3DCOCO minivalbox AP58.4DyHead (Swin-L, multi scale)
3DCOCO minivalbox AP46.5DyHead (ResNet-101)
3DCOCO minivalAPL66.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D ClassificationCOCO test-devAP5078.5DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO test-devAP7566.6DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO test-devAPL74.2DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO test-devAPM64DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO test-devbox mAP60.6DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO test-devAP5077.1DyHead (Swin-L, multi scale)
2D ClassificationCOCO test-devAP7564.5DyHead (Swin-L, multi scale)
2D ClassificationCOCO test-devAPL72.8DyHead (Swin-L, multi scale)
2D ClassificationCOCO test-devAPM62DyHead (Swin-L, multi scale)
2D ClassificationCOCO test-devbox mAP58.7DyHead (Swin-L, multi scale)
2D ClassificationCOCO test-devAP5072.1DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D ClassificationCOCO test-devAP7559.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D ClassificationCOCO test-devbox mAP54DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D ClassificationCOCO test-devAP5065.7DyHead (ResNeXt-64x4d-101)
2D ClassificationCOCO test-devAP7551.9DyHead (ResNeXt-64x4d-101)
2D ClassificationCOCO test-devbox mAP47.7DyHead (ResNeXt-64x4d-101)
2D ClassificationCOCO test-devAP5060.7DyHead (ResNet-50)
2D ClassificationCOCO test-devAP7546.8DyHead (ResNet-50)
2D ClassificationCOCO test-devbox mAP43DyHead (ResNet-50)
2D ClassificationCOCO-OAverage mAP35.3DyHead (Swin-L)
2D ClassificationCOCO-OEffective Robustness10DyHead (Swin-L)
2D ClassificationCOCO-OAverage mAP19.3DyHead (ResNet-50)
2D ClassificationCOCO-OEffective Robustness0.16DyHead (ResNet-50)
2D ClassificationCOCO 2017 valAP5068DyHead (Swin-T, multi scale)
2D ClassificationCOCO 2017 valAP7554.3DyHead (Swin-T, multi scale)
2D ClassificationCOCO 2017 valAPL64.2DyHead (Swin-T, multi scale)
2D ClassificationCOCO minivalAP5078.2DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO minivalAPL74.2DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO minivalbox AP60.3DyHead (Swin-L, multi scale, self-training)
2D ClassificationCOCO minivalAP5076.8DyHead (Swin-L, multi scale)
2D ClassificationCOCO minivalAPL73.2DyHead (Swin-L, multi scale)
2D ClassificationCOCO minivalAPM62.2DyHead (Swin-L, multi scale)
2D ClassificationCOCO minivalAPS44.5DyHead (Swin-L, multi scale)
2D ClassificationCOCO minivalbox AP58.4DyHead (Swin-L, multi scale)
2D ClassificationCOCO minivalbox AP46.5DyHead (ResNet-101)
2D ClassificationCOCO minivalAPL66.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D Object DetectionCOCO test-devAP5078.5DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO test-devAP7566.6DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO test-devAPL74.2DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO test-devAPM64DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO test-devbox mAP60.6DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO test-devAP5077.1DyHead (Swin-L, multi scale)
2D Object DetectionCOCO test-devAP7564.5DyHead (Swin-L, multi scale)
2D Object DetectionCOCO test-devAPL72.8DyHead (Swin-L, multi scale)
2D Object DetectionCOCO test-devAPM62DyHead (Swin-L, multi scale)
2D Object DetectionCOCO test-devbox mAP58.7DyHead (Swin-L, multi scale)
2D Object DetectionCOCO test-devAP5072.1DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D Object DetectionCOCO test-devAP7559.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D Object DetectionCOCO test-devbox mAP54DyHead (ResNeXt-64x4d-101-DCN, multi scale)
2D Object DetectionCOCO test-devAP5065.7DyHead (ResNeXt-64x4d-101)
2D Object DetectionCOCO test-devAP7551.9DyHead (ResNeXt-64x4d-101)
2D Object DetectionCOCO test-devbox mAP47.7DyHead (ResNeXt-64x4d-101)
2D Object DetectionCOCO test-devAP5060.7DyHead (ResNet-50)
2D Object DetectionCOCO test-devAP7546.8DyHead (ResNet-50)
2D Object DetectionCOCO test-devbox mAP43DyHead (ResNet-50)
2D Object DetectionCOCO-OAverage mAP35.3DyHead (Swin-L)
2D Object DetectionCOCO-OEffective Robustness10DyHead (Swin-L)
2D Object DetectionCOCO-OAverage mAP19.3DyHead (ResNet-50)
2D Object DetectionCOCO-OEffective Robustness0.16DyHead (ResNet-50)
2D Object DetectionCOCO 2017 valAP5068DyHead (Swin-T, multi scale)
2D Object DetectionCOCO 2017 valAP7554.3DyHead (Swin-T, multi scale)
2D Object DetectionCOCO 2017 valAPL64.2DyHead (Swin-T, multi scale)
2D Object DetectionCOCO minivalAP5078.2DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO minivalAPL74.2DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO minivalbox AP60.3DyHead (Swin-L, multi scale, self-training)
2D Object DetectionCOCO minivalAP5076.8DyHead (Swin-L, multi scale)
2D Object DetectionCOCO minivalAPL73.2DyHead (Swin-L, multi scale)
2D Object DetectionCOCO minivalAPM62.2DyHead (Swin-L, multi scale)
2D Object DetectionCOCO minivalAPS44.5DyHead (Swin-L, multi scale)
2D Object DetectionCOCO minivalbox AP58.4DyHead (Swin-L, multi scale)
2D Object DetectionCOCO minivalbox AP46.5DyHead (ResNet-101)
2D Object DetectionCOCO minivalAPL66.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
16kCOCO test-devAP5078.5DyHead (Swin-L, multi scale, self-training)
16kCOCO test-devAP7566.6DyHead (Swin-L, multi scale, self-training)
16kCOCO test-devAPL74.2DyHead (Swin-L, multi scale, self-training)
16kCOCO test-devAPM64DyHead (Swin-L, multi scale, self-training)
16kCOCO test-devbox mAP60.6DyHead (Swin-L, multi scale, self-training)
16kCOCO test-devAP5077.1DyHead (Swin-L, multi scale)
16kCOCO test-devAP7564.5DyHead (Swin-L, multi scale)
16kCOCO test-devAPL72.8DyHead (Swin-L, multi scale)
16kCOCO test-devAPM62DyHead (Swin-L, multi scale)
16kCOCO test-devbox mAP58.7DyHead (Swin-L, multi scale)
16kCOCO test-devAP5072.1DyHead (ResNeXt-64x4d-101-DCN, multi scale)
16kCOCO test-devAP7559.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)
16kCOCO test-devbox mAP54DyHead (ResNeXt-64x4d-101-DCN, multi scale)
16kCOCO test-devAP5065.7DyHead (ResNeXt-64x4d-101)
16kCOCO test-devAP7551.9DyHead (ResNeXt-64x4d-101)
16kCOCO test-devbox mAP47.7DyHead (ResNeXt-64x4d-101)
16kCOCO test-devAP5060.7DyHead (ResNet-50)
16kCOCO test-devAP7546.8DyHead (ResNet-50)
16kCOCO test-devbox mAP43DyHead (ResNet-50)
16kCOCO-OAverage mAP35.3DyHead (Swin-L)
16kCOCO-OEffective Robustness10DyHead (Swin-L)
16kCOCO-OAverage mAP19.3DyHead (ResNet-50)
16kCOCO-OEffective Robustness0.16DyHead (ResNet-50)
16kCOCO 2017 valAP5068DyHead (Swin-T, multi scale)
16kCOCO 2017 valAP7554.3DyHead (Swin-T, multi scale)
16kCOCO 2017 valAPL64.2DyHead (Swin-T, multi scale)
16kCOCO minivalAP5078.2DyHead (Swin-L, multi scale, self-training)
16kCOCO minivalAPL74.2DyHead (Swin-L, multi scale, self-training)
16kCOCO minivalbox AP60.3DyHead (Swin-L, multi scale, self-training)
16kCOCO minivalAP5076.8DyHead (Swin-L, multi scale)
16kCOCO minivalAPL73.2DyHead (Swin-L, multi scale)
16kCOCO minivalAPM62.2DyHead (Swin-L, multi scale)
16kCOCO minivalAPS44.5DyHead (Swin-L, multi scale)
16kCOCO minivalbox AP58.4DyHead (Swin-L, multi scale)
16kCOCO minivalbox AP46.5DyHead (ResNet-101)
16kCOCO minivalAPL66.3DyHead (ResNeXt-64x4d-101-DCN, multi scale)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07