YOLOv4: Optimal Speed and Accuracy of Object Detection

Alexey Bochkovskiy, Chien-Yao Wang, Hong-Yuan Mark Liao

2020-04-23Data Augmentation Real-Time Object Detection BIG-bench Machine Learning Object Detection

Abstract

There are a huge number of features which are said to improve Convolutional Neural Network (CNN) accuracy. Practical testing of combinations of such features on large datasets, and theoretical justification of the result, is required. Some features operate on certain models exclusively and for certain problems exclusively, or only for small-scale datasets; while some features, such as batch-normalization and residual-connections, are applicable to the majority of models, tasks, and datasets. We assume that such universal features include Weighted-Residual-Connections (WRC), Cross-Stage-Partial-connections (CSP), Cross mini-Batch Normalization (CmBN), Self-adversarial-training (SAT) and Mish-activation. We use new features: WRC, CSP, CmBN, SAT, Mish activation, Mosaic data augmentation, CmBN, DropBlock regularization, and CIoU loss, and combine some of them to achieve state-of-the-art results: 43.5% AP (65.7% AP50) for the MS COCO dataset at a realtime speed of ~65 FPS on Tesla V100. Source code is at https://github.com/AlexeyAB/darknet

Results

Task	Dataset	Metric	Value	Model
Object Detection	COCO test-dev	AP50	65.7	YOLOv4-608
Object Detection	COCO test-dev	AP75	47.3	YOLOv4-608
Object Detection	COCO test-dev	APL	53.3	YOLOv4-608
Object Detection	COCO test-dev	APM	46.7	YOLOv4-608
Object Detection	COCO test-dev	APS	26.7	YOLOv4-608
Object Detection	COCO test-dev	box mAP	43.5	YOLOv4-608
Object Detection	COCO-O	Average mAP	30.4	YOLOv4-P6
Object Detection	COCO-O	Effective Robustness	5.89	YOLOv4-P6
Object Detection	PKU-DDD17-Car	mAP50	81.3	YOLOv4
Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	23	YOLOv4-L
Object Detection	COCO (Common Objects in Context)	box AP	43.5	YOLOv4-L
Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	31	YOLOv4-M
Object Detection	COCO (Common Objects in Context)	box AP	43	YOLOv4-M
Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	38	YOLOv4-S
Object Detection	COCO (Common Objects in Context)	box AP	41.2	YOLOv4-S
3D	COCO test-dev	AP50	65.7	YOLOv4-608
3D	COCO test-dev	AP75	47.3	YOLOv4-608
3D	COCO test-dev	APL	53.3	YOLOv4-608
3D	COCO test-dev	APM	46.7	YOLOv4-608
3D	COCO test-dev	APS	26.7	YOLOv4-608
3D	COCO test-dev	box mAP	43.5	YOLOv4-608
3D	COCO-O	Average mAP	30.4	YOLOv4-P6
3D	COCO-O	Effective Robustness	5.89	YOLOv4-P6
3D	PKU-DDD17-Car	mAP50	81.3	YOLOv4
3D	COCO (Common Objects in Context)	FPS (V100, b=1)	23	YOLOv4-L
3D	COCO (Common Objects in Context)	box AP	43.5	YOLOv4-L
3D	COCO (Common Objects in Context)	FPS (V100, b=1)	31	YOLOv4-M
3D	COCO (Common Objects in Context)	box AP	43	YOLOv4-M
3D	COCO (Common Objects in Context)	FPS (V100, b=1)	38	YOLOv4-S
3D	COCO (Common Objects in Context)	box AP	41.2	YOLOv4-S
2D Classification	COCO test-dev	AP50	65.7	YOLOv4-608
2D Classification	COCO test-dev	AP75	47.3	YOLOv4-608
2D Classification	COCO test-dev	APL	53.3	YOLOv4-608
2D Classification	COCO test-dev	APM	46.7	YOLOv4-608
2D Classification	COCO test-dev	APS	26.7	YOLOv4-608
2D Classification	COCO test-dev	box mAP	43.5	YOLOv4-608
2D Classification	COCO-O	Average mAP	30.4	YOLOv4-P6
2D Classification	COCO-O	Effective Robustness	5.89	YOLOv4-P6
2D Classification	PKU-DDD17-Car	mAP50	81.3	YOLOv4
2D Classification	COCO (Common Objects in Context)	FPS (V100, b=1)	23	YOLOv4-L
2D Classification	COCO (Common Objects in Context)	box AP	43.5	YOLOv4-L
2D Classification	COCO (Common Objects in Context)	FPS (V100, b=1)	31	YOLOv4-M
2D Classification	COCO (Common Objects in Context)	box AP	43	YOLOv4-M
2D Classification	COCO (Common Objects in Context)	FPS (V100, b=1)	38	YOLOv4-S
2D Classification	COCO (Common Objects in Context)	box AP	41.2	YOLOv4-S
2D Object Detection	COCO test-dev	AP50	65.7	YOLOv4-608
2D Object Detection	COCO test-dev	AP75	47.3	YOLOv4-608
2D Object Detection	COCO test-dev	APL	53.3	YOLOv4-608
2D Object Detection	COCO test-dev	APM	46.7	YOLOv4-608
2D Object Detection	COCO test-dev	APS	26.7	YOLOv4-608
2D Object Detection	COCO test-dev	box mAP	43.5	YOLOv4-608
2D Object Detection	COCO-O	Average mAP	30.4	YOLOv4-P6
2D Object Detection	COCO-O	Effective Robustness	5.89	YOLOv4-P6
2D Object Detection	PKU-DDD17-Car	mAP50	81.3	YOLOv4
2D Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	23	YOLOv4-L
2D Object Detection	COCO (Common Objects in Context)	box AP	43.5	YOLOv4-L
2D Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	31	YOLOv4-M
2D Object Detection	COCO (Common Objects in Context)	box AP	43	YOLOv4-M
2D Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	38	YOLOv4-S
2D Object Detection	COCO (Common Objects in Context)	box AP	41.2	YOLOv4-S
16k	COCO test-dev	AP50	65.7	YOLOv4-608
16k	COCO test-dev	AP75	47.3	YOLOv4-608
16k	COCO test-dev	APL	53.3	YOLOv4-608
16k	COCO test-dev	APM	46.7	YOLOv4-608
16k	COCO test-dev	APS	26.7	YOLOv4-608
16k	COCO test-dev	box mAP	43.5	YOLOv4-608
16k	COCO-O	Average mAP	30.4	YOLOv4-P6
16k	COCO-O	Effective Robustness	5.89	YOLOv4-P6
16k	PKU-DDD17-Car	mAP50	81.3	YOLOv4
16k	COCO (Common Objects in Context)	FPS (V100, b=1)	23	YOLOv4-L
16k	COCO (Common Objects in Context)	box AP	43.5	YOLOv4-L
16k	COCO (Common Objects in Context)	FPS (V100, b=1)	31	YOLOv4-M
16k	COCO (Common Objects in Context)	box AP	43	YOLOv4-M
16k	COCO (Common Objects in Context)	FPS (V100, b=1)	38	YOLOv4-S
16k	COCO (Common Objects in Context)	box AP	41.2	YOLOv4-S

YOLOv4: Optimal Speed and Accuracy of Object Detection

Abstract

Results

Related Papers

YOLOv4: Optimal Speed and Accuracy of Object Detection

Abstract

Results

Related Papers