TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOX: Exceeding YOLO Series in 2021

YOLOX: Exceeding YOLO Series in 2021

Zheng Ge, Songtao Liu, Feng Wang, Zeming Li, Jian Sun

2021-07-18Real-Time Object DetectionAutonomous Driving2D Object DetectionObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

In this report, we present some experienced improvements to YOLO series, forming a new high-performance detector -- YOLOX. We switch the YOLO detector to an anchor-free manner and conduct other advanced detection techniques, i.e., a decoupled head and the leading label assignment strategy SimOTA to achieve state-of-the-art results across a large scale range of models: For YOLO-Nano with only 0.91M parameters and 1.08G FLOPs, we get 25.3% AP on COCO, surpassing NanoDet by 1.8% AP; for YOLOv3, one of the most widely used detectors in industry, we boost it to 47.3% AP on COCO, outperforming the current best practice by 3.0% AP; for YOLOX-L with roughly the same amount of parameters as YOLOv4-CSP, YOLOv5-L, we achieve 50.0% AP on COCO at a speed of 68.9 FPS on Tesla V100, exceeding YOLOv5-L by 1.8% AP. Further, we won the 1st Place on Streaming Perception Challenge (Workshop on Autonomous Driving at CVPR 2021) using a single YOLOX-L model. We hope this report can provide useful experience for developers and researchers in practical scenes, and we also provide deploy versions with ONNX, TensorRT, NCNN, and Openvino supported. Source code is at https://github.com/Megvii-BaseDetection/YOLOX.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devbox mAP51.5YOLOX-x(Modified CSP v5, 640x640, single-scale)
Object DetectionCOCO test-devAP5069.6YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devAP7555.7YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devAPL66.1YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devAPM56.1YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devAPS31.2YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devParams (M)99.1YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devbox mAP51.2YOLOX-X (Modified CSP v5)
Object DetectionCOCO test-devbox mAP48YOLOX-Darknet53(Darknet53, 640x640, single-scale)
Object DetectionCOCO-OAverage mAP30.3YOLOX-X
Object DetectionCOCO-OEffective Robustness7.26YOLOX-X
Object DetectionCOCO-OAverage mAP20.6YOLOX-S
Object DetectionCOCO-OEffective Robustness2.48YOLOX-S
Object DetectionWaterScenesmAP@50-9557.8YOLOX-M
Object DetectionArgoverse-HD (Full-Stack, Test)AP41.1YOLOX
Object DetectionArgoverse-HD (Detection-Only, Test)AP41.1YOLOX
Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)62.5YOLOv5-X
Object DetectionCOCO (Common Objects in Context)box AP50.4YOLOv5-X
Object DetectionArgoverse-HD (Detection-Only, Val)AP47.42YOLOX
3DCOCO test-devbox mAP51.5YOLOX-x(Modified CSP v5, 640x640, single-scale)
3DCOCO test-devAP5069.6YOLOX-X (Modified CSP v5)
3DCOCO test-devAP7555.7YOLOX-X (Modified CSP v5)
3DCOCO test-devAPL66.1YOLOX-X (Modified CSP v5)
3DCOCO test-devAPM56.1YOLOX-X (Modified CSP v5)
3DCOCO test-devAPS31.2YOLOX-X (Modified CSP v5)
3DCOCO test-devParams (M)99.1YOLOX-X (Modified CSP v5)
3DCOCO test-devbox mAP51.2YOLOX-X (Modified CSP v5)
3DCOCO test-devbox mAP48YOLOX-Darknet53(Darknet53, 640x640, single-scale)
3DCOCO-OAverage mAP30.3YOLOX-X
3DCOCO-OEffective Robustness7.26YOLOX-X
3DCOCO-OAverage mAP20.6YOLOX-S
3DCOCO-OEffective Robustness2.48YOLOX-S
3DWaterScenesmAP@50-9557.8YOLOX-M
3DArgoverse-HD (Full-Stack, Test)AP41.1YOLOX
3DArgoverse-HD (Detection-Only, Test)AP41.1YOLOX
3DCOCO (Common Objects in Context)FPS (V100, b=1)62.5YOLOv5-X
3DCOCO (Common Objects in Context)box AP50.4YOLOv5-X
3DArgoverse-HD (Detection-Only, Val)AP47.42YOLOX
2D ClassificationCOCO test-devbox mAP51.5YOLOX-x(Modified CSP v5, 640x640, single-scale)
2D ClassificationCOCO test-devAP5069.6YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devAP7555.7YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devAPL66.1YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devAPM56.1YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devAPS31.2YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devParams (M)99.1YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devbox mAP51.2YOLOX-X (Modified CSP v5)
2D ClassificationCOCO test-devbox mAP48YOLOX-Darknet53(Darknet53, 640x640, single-scale)
2D ClassificationCOCO-OAverage mAP30.3YOLOX-X
2D ClassificationCOCO-OEffective Robustness7.26YOLOX-X
2D ClassificationCOCO-OAverage mAP20.6YOLOX-S
2D ClassificationCOCO-OEffective Robustness2.48YOLOX-S
2D ClassificationWaterScenesmAP@50-9557.8YOLOX-M
2D ClassificationArgoverse-HD (Full-Stack, Test)AP41.1YOLOX
2D ClassificationArgoverse-HD (Detection-Only, Test)AP41.1YOLOX
2D ClassificationCOCO (Common Objects in Context)FPS (V100, b=1)62.5YOLOv5-X
2D ClassificationCOCO (Common Objects in Context)box AP50.4YOLOv5-X
2D ClassificationArgoverse-HD (Detection-Only, Val)AP47.42YOLOX
2D Object DetectionCeyMomAP57.7YOLOX
2D Object DetectionCOCO test-devbox mAP51.5YOLOX-x(Modified CSP v5, 640x640, single-scale)
2D Object DetectionCOCO test-devAP5069.6YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devAP7555.7YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devAPL66.1YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devAPM56.1YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devAPS31.2YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devParams (M)99.1YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devbox mAP51.2YOLOX-X (Modified CSP v5)
2D Object DetectionCOCO test-devbox mAP48YOLOX-Darknet53(Darknet53, 640x640, single-scale)
2D Object DetectionCOCO-OAverage mAP30.3YOLOX-X
2D Object DetectionCOCO-OEffective Robustness7.26YOLOX-X
2D Object DetectionCOCO-OAverage mAP20.6YOLOX-S
2D Object DetectionCOCO-OEffective Robustness2.48YOLOX-S
2D Object DetectionWaterScenesmAP@50-9557.8YOLOX-M
2D Object DetectionArgoverse-HD (Full-Stack, Test)AP41.1YOLOX
2D Object DetectionArgoverse-HD (Detection-Only, Test)AP41.1YOLOX
2D Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)62.5YOLOv5-X
2D Object DetectionCOCO (Common Objects in Context)box AP50.4YOLOv5-X
2D Object DetectionArgoverse-HD (Detection-Only, Val)AP47.42YOLOX
16kCOCO test-devbox mAP51.5YOLOX-x(Modified CSP v5, 640x640, single-scale)
16kCOCO test-devAP5069.6YOLOX-X (Modified CSP v5)
16kCOCO test-devAP7555.7YOLOX-X (Modified CSP v5)
16kCOCO test-devAPL66.1YOLOX-X (Modified CSP v5)
16kCOCO test-devAPM56.1YOLOX-X (Modified CSP v5)
16kCOCO test-devAPS31.2YOLOX-X (Modified CSP v5)
16kCOCO test-devParams (M)99.1YOLOX-X (Modified CSP v5)
16kCOCO test-devbox mAP51.2YOLOX-X (Modified CSP v5)
16kCOCO test-devbox mAP48YOLOX-Darknet53(Darknet53, 640x640, single-scale)
16kCOCO-OAverage mAP30.3YOLOX-X
16kCOCO-OEffective Robustness7.26YOLOX-X
16kCOCO-OAverage mAP20.6YOLOX-S
16kCOCO-OEffective Robustness2.48YOLOX-S
16kWaterScenesmAP@50-9557.8YOLOX-M
16kArgoverse-HD (Full-Stack, Test)AP41.1YOLOX
16kArgoverse-HD (Detection-Only, Test)AP41.1YOLOX
16kCOCO (Common Objects in Context)FPS (V100, b=1)62.5YOLOv5-X
16kCOCO (Common Objects in Context)box AP50.4YOLOv5-X
16kArgoverse-HD (Detection-Only, Val)AP47.42YOLOX

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17