TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOv4: Optimal Speed and Accuracy of Object Detection

YOLOv4: Optimal Speed and Accuracy of Object Detection

Alexey Bochkovskiy, Chien-Yao Wang, Hong-Yuan Mark Liao

2020-04-23Data AugmentationReal-Time Object DetectionBIG-bench Machine LearningObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

There are a huge number of features which are said to improve Convolutional Neural Network (CNN) accuracy. Practical testing of combinations of such features on large datasets, and theoretical justification of the result, is required. Some features operate on certain models exclusively and for certain problems exclusively, or only for small-scale datasets; while some features, such as batch-normalization and residual-connections, are applicable to the majority of models, tasks, and datasets. We assume that such universal features include Weighted-Residual-Connections (WRC), Cross-Stage-Partial-connections (CSP), Cross mini-Batch Normalization (CmBN), Self-adversarial-training (SAT) and Mish-activation. We use new features: WRC, CSP, CmBN, SAT, Mish activation, Mosaic data augmentation, CmBN, DropBlock regularization, and CIoU loss, and combine some of them to achieve state-of-the-art results: 43.5% AP (65.7% AP50) for the MS COCO dataset at a realtime speed of ~65 FPS on Tesla V100. Source code is at https://github.com/AlexeyAB/darknet

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5065.7YOLOv4-608
Object DetectionCOCO test-devAP7547.3YOLOv4-608
Object DetectionCOCO test-devAPL53.3YOLOv4-608
Object DetectionCOCO test-devAPM46.7YOLOv4-608
Object DetectionCOCO test-devAPS26.7YOLOv4-608
Object DetectionCOCO test-devbox mAP43.5YOLOv4-608
Object DetectionCOCO-OAverage mAP30.4YOLOv4-P6
Object DetectionCOCO-OEffective Robustness5.89YOLOv4-P6
Object DetectionPKU-DDD17-Car mAP5081.3YOLOv4
Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)23YOLOv4-L
Object DetectionCOCO (Common Objects in Context)box AP43.5YOLOv4-L
Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)31YOLOv4-M
Object DetectionCOCO (Common Objects in Context)box AP43YOLOv4-M
Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)38YOLOv4-S
Object DetectionCOCO (Common Objects in Context)box AP41.2YOLOv4-S
3DCOCO test-devAP5065.7YOLOv4-608
3DCOCO test-devAP7547.3YOLOv4-608
3DCOCO test-devAPL53.3YOLOv4-608
3DCOCO test-devAPM46.7YOLOv4-608
3DCOCO test-devAPS26.7YOLOv4-608
3DCOCO test-devbox mAP43.5YOLOv4-608
3DCOCO-OAverage mAP30.4YOLOv4-P6
3DCOCO-OEffective Robustness5.89YOLOv4-P6
3DPKU-DDD17-Car mAP5081.3YOLOv4
3DCOCO (Common Objects in Context)FPS (V100, b=1)23YOLOv4-L
3DCOCO (Common Objects in Context)box AP43.5YOLOv4-L
3DCOCO (Common Objects in Context)FPS (V100, b=1)31YOLOv4-M
3DCOCO (Common Objects in Context)box AP43YOLOv4-M
3DCOCO (Common Objects in Context)FPS (V100, b=1)38YOLOv4-S
3DCOCO (Common Objects in Context)box AP41.2YOLOv4-S
2D ClassificationCOCO test-devAP5065.7YOLOv4-608
2D ClassificationCOCO test-devAP7547.3YOLOv4-608
2D ClassificationCOCO test-devAPL53.3YOLOv4-608
2D ClassificationCOCO test-devAPM46.7YOLOv4-608
2D ClassificationCOCO test-devAPS26.7YOLOv4-608
2D ClassificationCOCO test-devbox mAP43.5YOLOv4-608
2D ClassificationCOCO-OAverage mAP30.4YOLOv4-P6
2D ClassificationCOCO-OEffective Robustness5.89YOLOv4-P6
2D ClassificationPKU-DDD17-Car mAP5081.3YOLOv4
2D ClassificationCOCO (Common Objects in Context)FPS (V100, b=1)23YOLOv4-L
2D ClassificationCOCO (Common Objects in Context)box AP43.5YOLOv4-L
2D ClassificationCOCO (Common Objects in Context)FPS (V100, b=1)31YOLOv4-M
2D ClassificationCOCO (Common Objects in Context)box AP43YOLOv4-M
2D ClassificationCOCO (Common Objects in Context)FPS (V100, b=1)38YOLOv4-S
2D ClassificationCOCO (Common Objects in Context)box AP41.2YOLOv4-S
2D Object DetectionCOCO test-devAP5065.7YOLOv4-608
2D Object DetectionCOCO test-devAP7547.3YOLOv4-608
2D Object DetectionCOCO test-devAPL53.3YOLOv4-608
2D Object DetectionCOCO test-devAPM46.7YOLOv4-608
2D Object DetectionCOCO test-devAPS26.7YOLOv4-608
2D Object DetectionCOCO test-devbox mAP43.5YOLOv4-608
2D Object DetectionCOCO-OAverage mAP30.4YOLOv4-P6
2D Object DetectionCOCO-OEffective Robustness5.89YOLOv4-P6
2D Object DetectionPKU-DDD17-Car mAP5081.3YOLOv4
2D Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)23YOLOv4-L
2D Object DetectionCOCO (Common Objects in Context)box AP43.5YOLOv4-L
2D Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)31YOLOv4-M
2D Object DetectionCOCO (Common Objects in Context)box AP43YOLOv4-M
2D Object DetectionCOCO (Common Objects in Context)FPS (V100, b=1)38YOLOv4-S
2D Object DetectionCOCO (Common Objects in Context)box AP41.2YOLOv4-S
16kCOCO test-devAP5065.7YOLOv4-608
16kCOCO test-devAP7547.3YOLOv4-608
16kCOCO test-devAPL53.3YOLOv4-608
16kCOCO test-devAPM46.7YOLOv4-608
16kCOCO test-devAPS26.7YOLOv4-608
16kCOCO test-devbox mAP43.5YOLOv4-608
16kCOCO-OAverage mAP30.4YOLOv4-P6
16kCOCO-OEffective Robustness5.89YOLOv4-P6
16kPKU-DDD17-Car mAP5081.3YOLOv4
16kCOCO (Common Objects in Context)FPS (V100, b=1)23YOLOv4-L
16kCOCO (Common Objects in Context)box AP43.5YOLOv4-L
16kCOCO (Common Objects in Context)FPS (V100, b=1)31YOLOv4-M
16kCOCO (Common Objects in Context)box AP43YOLOv4-M
16kCOCO (Common Objects in Context)FPS (V100, b=1)38YOLOv4-S
16kCOCO (Common Objects in Context)box AP41.2YOLOv4-S

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16