TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OTA: Optimal Transport Assignment for Object Detection

OTA: Optimal Transport Assignment for Object Detection

Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun

2021-03-26CVPR 2021 1object-detectionObject Detection
PaperPDFCodeCode(official)

Abstract

Recent advances in label assignment in object detection mainly seek to independently define positive/negative training samples for each ground-truth (gt) object. In this paper, we innovatively revisit the label assignment from a global perspective and propose to formulate the assigning procedure as an Optimal Transport (OT) problem -- a well-studied topic in Optimization Theory. Concretely, we define the unit transportation cost between each demander (anchor) and supplier (gt) pair as the weighted summation of their classification and regression losses. After formulation, finding the best assignment solution is converted to solve the optimal transport plan at minimal transportation costs, which can be solved via Sinkhorn-Knopp Iteration. On COCO, a single FCOS-ResNet-50 detector equipped with Optimal Transport Assignment (OTA) can reach 40.7% mAP under 1X scheduler, outperforming all other existing assigning methods. Extensive experiments conducted on COCO and CrowdHuman further validate the effectiveness of our proposed OTA, especially its superiority in crowd scenarios. The code is available at https://github.com/Megvii-BaseDetection/OTA.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5068.6OTA (ResNeXt-101+DCN, multiscale)
Object DetectionCOCO test-devAP7557.1OTA (ResNeXt-101+DCN, multiscale)
Object DetectionCOCO test-devAPL64.1OTA (ResNeXt-101+DCN, multiscale)
Object DetectionCOCO test-devAPM53.7OTA (ResNeXt-101+DCN, multiscale)
Object DetectionCOCO test-devAPS34.1OTA (ResNeXt-101+DCN, multiscale)
Object DetectionCOCO test-devbox mAP51.5OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devAP5068.6OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devAP7557.1OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devAPL64.1OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devAPM53.7OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devAPS34.1OTA (ResNeXt-101+DCN, multiscale)
3DCOCO test-devbox mAP51.5OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devAP5068.6OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devAP7557.1OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devAPL64.1OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devAPM53.7OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devAPS34.1OTA (ResNeXt-101+DCN, multiscale)
2D ClassificationCOCO test-devbox mAP51.5OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devAP5068.6OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devAP7557.1OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devAPL64.1OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devAPM53.7OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devAPS34.1OTA (ResNeXt-101+DCN, multiscale)
2D Object DetectionCOCO test-devbox mAP51.5OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devAP5068.6OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devAP7557.1OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devAPL64.1OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devAPM53.7OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devAPS34.1OTA (ResNeXt-101+DCN, multiscale)
16kCOCO test-devbox mAP51.5OTA (ResNeXt-101+DCN, multiscale)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07