TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TOOD: Task-aligned One-stage Object Detection

TOOD: Task-aligned One-stage Object Detection

Chengjian Feng, Yujie Zhong, Yu Gao, Matthew R. Scott, Weilin Huang

2021-08-17ICCV 2021 102D Object Detectionobject-detectionObject Detection
PaperPDFCode(official)CodeCodeCodeCodeCode

Abstract

One-stage object detection is commonly implemented by optimizing two sub-tasks: object classification and localization, using heads with two parallel branches, which might lead to a certain level of spatial misalignment in predictions between the two tasks. In this work, we propose a Task-aligned One-stage Object Detection (TOOD) that explicitly aligns the two tasks in a learning-based manner. First, we design a novel Task-aligned Head (T-Head) which offers a better balance between learning task-interactive and task-specific features, as well as a greater flexibility to learn the alignment via a task-aligned predictor. Second, we propose Task Alignment Learning (TAL) to explicitly pull closer (or even unify) the optimal anchors for the two tasks during training via a designed sample assignment scheme and a task-aligned loss. Extensive experiments are conducted on MS-COCO, where TOOD achieves a 51.1 AP at single-model single-scale testing. This surpasses the recent one-stage detectors by a large margin, such as ATSS (47.7 AP), GFL (48.2 AP), and PAA (49.0 AP), with fewer parameters and FLOPs. Qualitative results also demonstrate the effectiveness of TOOD for better aligning the tasks of object classification and localization. Code is available at https://github.com/fcjian/TOOD.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5060.3TAL + TAP
Object DetectionCOCO test-devAP7546.4TAL + TAP
Object DetectionCOCO test-devbox mAP42.5TAL + TAP
3DCOCO test-devAP5060.3TAL + TAP
3DCOCO test-devAP7546.4TAL + TAP
3DCOCO test-devbox mAP42.5TAL + TAP
2D ClassificationCOCO test-devAP5060.3TAL + TAP
2D ClassificationCOCO test-devAP7546.4TAL + TAP
2D ClassificationCOCO test-devbox mAP42.5TAL + TAP
2D Object DetectionCeyMomAP65.6TOOD
2D Object DetectionCOCO test-devAP5060.3TAL + TAP
2D Object DetectionCOCO test-devAP7546.4TAL + TAP
2D Object DetectionCOCO test-devbox mAP42.5TAL + TAP
16kCOCO test-devAP5060.3TAL + TAP
16kCOCO test-devAP7546.4TAL + TAP
16kCOCO test-devbox mAP42.5TAL + TAP

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07