TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Scaled-YOLOv4: Scaling Cross Stage Partial Network

Scaled-YOLOv4: Scaling Cross Stage Partial Network

Chien-Yao Wang, Alexey Bochkovskiy, Hong-Yuan Mark Liao

2020-11-16CVPR 2021 1Real-Time Object Detection2D Object DetectionObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

We show that the YOLOv4 object detection neural network based on the CSP approach, scales both up and down and is applicable to small and large networks while maintaining optimal speed and accuracy. We propose a network scaling approach that modifies not only the depth, width, resolution, but also structure of the network. YOLOv4-large model achieves state-of-the-art results: 55.5% AP (73.4% AP50) for the MS COCO dataset at a speed of ~16 FPS on Tesla V100, while with the test time augmentation, YOLOv4-large achieves 56.0% AP (73.3 AP50). To the best of our knowledge, this is currently the highest accuracy on the COCO dataset among any published work. The YOLOv4-tiny model achieves 22.0% AP (42.0% AP50) at a speed of 443 FPS on RTX 2080Ti, while by using TensorRT, batch size = 4 and FP16-precision the YOLOv4-tiny achieves 1774 FPS.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5073.2YOLOv4-P7 with TTA
Object DetectionCOCO test-devAP7561.2YOLOv4-P7 with TTA
Object DetectionCOCO test-devbox mAP55.8YOLOv4-P7 with TTA
Object DetectionCOCO test-devAP5072.6YOLOv4-P6 with TTA
Object DetectionCOCO test-devAP7560.2YOLOv4-P6 with TTA
Object DetectionCOCO test-devbox mAP54.9YOLOv4-P6 with TTA
Object DetectionCOCO test-devAP5072.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devAP7559.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devAPL65.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devAPM58.2YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devAPS36.6YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devbox mAP54.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
Object DetectionCOCO test-devAP5070.3YOLOv4-P5 with TTA
Object DetectionCOCO test-devAP7558YOLOv4-P5 with TTA
Object DetectionCOCO test-devbox mAP52.5YOLOv4-P5 with TTA
Object DetectionCOCO test-devAP5064.1YOLOv4 (CD53)
Object DetectionCOCO test-devAP7549.5YOLOv4 (CD53)
Object DetectionCOCO test-devAPL56.7YOLOv4 (CD53)
Object DetectionCOCO test-devAPM49YOLOv4 (CD53)
Object DetectionCOCO test-devAPS27YOLOv4 (CD53)
Object DetectionCOCO test-devbox mAP45.5YOLOv4 (CD53)
Object DetectionCOCO minivalAP5073.3YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
Object DetectionCOCO minivalAP7560.7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
Object DetectionCOCO minivalAPL67.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
Object DetectionCOCO minivalAPM59.5YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
Object DetectionCOCO minivalAPS38.1YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
Object DetectionCOCO minivalbox AP55.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO test-devAP5073.2YOLOv4-P7 with TTA
3DCOCO test-devAP7561.2YOLOv4-P7 with TTA
3DCOCO test-devbox mAP55.8YOLOv4-P7 with TTA
3DCOCO test-devAP5072.6YOLOv4-P6 with TTA
3DCOCO test-devAP7560.2YOLOv4-P6 with TTA
3DCOCO test-devbox mAP54.9YOLOv4-P6 with TTA
3DCOCO test-devAP5072.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devAP7559.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devAPL65.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devAPM58.2YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devAPS36.6YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devbox mAP54.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
3DCOCO test-devAP5070.3YOLOv4-P5 with TTA
3DCOCO test-devAP7558YOLOv4-P5 with TTA
3DCOCO test-devbox mAP52.5YOLOv4-P5 with TTA
3DCOCO test-devAP5064.1YOLOv4 (CD53)
3DCOCO test-devAP7549.5YOLOv4 (CD53)
3DCOCO test-devAPL56.7YOLOv4 (CD53)
3DCOCO test-devAPM49YOLOv4 (CD53)
3DCOCO test-devAPS27YOLOv4 (CD53)
3DCOCO test-devbox mAP45.5YOLOv4 (CD53)
3DCOCO minivalAP5073.3YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO minivalAP7560.7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO minivalAPL67.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO minivalAPM59.5YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO minivalAPS38.1YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
3DCOCO minivalbox AP55.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO test-devAP5073.2YOLOv4-P7 with TTA
2D ClassificationCOCO test-devAP7561.2YOLOv4-P7 with TTA
2D ClassificationCOCO test-devbox mAP55.8YOLOv4-P7 with TTA
2D ClassificationCOCO test-devAP5072.6YOLOv4-P6 with TTA
2D ClassificationCOCO test-devAP7560.2YOLOv4-P6 with TTA
2D ClassificationCOCO test-devbox mAP54.9YOLOv4-P6 with TTA
2D ClassificationCOCO test-devAP5072.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devAP7559.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devAPL65.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devAPM58.2YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devAPS36.6YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devbox mAP54.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D ClassificationCOCO test-devAP5070.3YOLOv4-P5 with TTA
2D ClassificationCOCO test-devAP7558YOLOv4-P5 with TTA
2D ClassificationCOCO test-devbox mAP52.5YOLOv4-P5 with TTA
2D ClassificationCOCO test-devAP5064.1YOLOv4 (CD53)
2D ClassificationCOCO test-devAP7549.5YOLOv4 (CD53)
2D ClassificationCOCO test-devAPL56.7YOLOv4 (CD53)
2D ClassificationCOCO test-devAPM49YOLOv4 (CD53)
2D ClassificationCOCO test-devAPS27YOLOv4 (CD53)
2D ClassificationCOCO test-devbox mAP45.5YOLOv4 (CD53)
2D ClassificationCOCO minivalAP5073.3YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO minivalAP7560.7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO minivalAPL67.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO minivalAPM59.5YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO minivalAPS38.1YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D ClassificationCOCO minivalbox AP55.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO test-devAP5073.2YOLOv4-P7 with TTA
2D Object DetectionCOCO test-devAP7561.2YOLOv4-P7 with TTA
2D Object DetectionCOCO test-devbox mAP55.8YOLOv4-P7 with TTA
2D Object DetectionCOCO test-devAP5072.6YOLOv4-P6 with TTA
2D Object DetectionCOCO test-devAP7560.2YOLOv4-P6 with TTA
2D Object DetectionCOCO test-devbox mAP54.9YOLOv4-P6 with TTA
2D Object DetectionCOCO test-devAP5072.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devAP7559.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devAPL65.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devAPM58.2YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devAPS36.6YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devbox mAP54.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
2D Object DetectionCOCO test-devAP5070.3YOLOv4-P5 with TTA
2D Object DetectionCOCO test-devAP7558YOLOv4-P5 with TTA
2D Object DetectionCOCO test-devbox mAP52.5YOLOv4-P5 with TTA
2D Object DetectionCOCO test-devAP5064.1YOLOv4 (CD53)
2D Object DetectionCOCO test-devAP7549.5YOLOv4 (CD53)
2D Object DetectionCOCO test-devAPL56.7YOLOv4 (CD53)
2D Object DetectionCOCO test-devAPM49YOLOv4 (CD53)
2D Object DetectionCOCO test-devAPS27YOLOv4 (CD53)
2D Object DetectionCOCO test-devbox mAP45.5YOLOv4 (CD53)
2D Object DetectionCOCO minivalAP5073.3YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO minivalAP7560.7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO minivalAPL67.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO minivalAPM59.5YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO minivalAPS38.1YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
2D Object DetectionCOCO minivalbox AP55.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO test-devAP5073.2YOLOv4-P7 with TTA
16kCOCO test-devAP7561.2YOLOv4-P7 with TTA
16kCOCO test-devbox mAP55.8YOLOv4-P7 with TTA
16kCOCO test-devAP5072.6YOLOv4-P6 with TTA
16kCOCO test-devAP7560.2YOLOv4-P6 with TTA
16kCOCO test-devbox mAP54.9YOLOv4-P6 with TTA
16kCOCO test-devAP5072.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devAP7559.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devAPL65.5YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devAPM58.2YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devAPS36.6YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devbox mAP54.3YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
16kCOCO test-devAP5070.3YOLOv4-P5 with TTA
16kCOCO test-devAP7558YOLOv4-P5 with TTA
16kCOCO test-devbox mAP52.5YOLOv4-P5 with TTA
16kCOCO test-devAP5064.1YOLOv4 (CD53)
16kCOCO test-devAP7549.5YOLOv4 (CD53)
16kCOCO test-devAPL56.7YOLOv4 (CD53)
16kCOCO test-devAPM49YOLOv4 (CD53)
16kCOCO test-devAPS27YOLOv4 (CD53)
16kCOCO test-devbox mAP45.5YOLOv4 (CD53)
16kCOCO minivalAP5073.3YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO minivalAP7560.7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO minivalAPL67.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO minivalAPM59.5YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO minivalAPS38.1YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
16kCOCO minivalbox AP55.4YOLOv4-P7 CSP-P7 (single-scale, 16 fps)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07