YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

Chuyi Li, Lulu Li, Hongliang Jiang, Kaiheng Weng, Yifei Geng, Liang Li, Zaidan Ke, Qingyuan Li, Meng Cheng, Weiqiang Nie, Yiduo Li, Bo Zhang, Yufei Liang, Linyuan Zhou, Xiaoming Xu, Xiangxiang Chu, Xiaoming Wei, Xiaolin Wei

2022-09-07Quantization Real-Time Object Detection Pedestrian Detection Object Detection

Paper PDF Code Code Code Code Code Code(official)Code

Abstract

For years, the YOLO series has been the de facto industry-level standard for efficient object detection. The YOLO community has prospered overwhelmingly to enrich its use in a multitude of hardware platforms and abundant scenarios. In this technical report, we strive to push its limits to the next level, stepping forward with an unwavering mindset for industry application. Considering the diverse requirements for speed and accuracy in the real environment, we extensively examine the up-to-date object detection advancements either from industry or academia. Specifically, we heavily assimilate ideas from recent network design, training strategies, testing techniques, quantization, and optimization methods. On top of this, we integrate our thoughts and practice to build a suite of deployment-ready networks at various scales to accommodate diversified use cases. With the generous permission of YOLO authors, we name it YOLOv6. We also express our warm welcome to users and contributors for further enhancement. For a glimpse of performance, our YOLOv6-N hits 35.9% AP on the COCO dataset at a throughput of 1234 FPS on an NVIDIA Tesla T4 GPU. YOLOv6-S strikes 43.5% AP at 495 FPS, outperforming other mainstream detectors at the same scale~(YOLOv5-S, YOLOX-S, and PPYOLOE-S). Our quantized version of YOLOv6-S even brings a new state-of-the-art 43.3% AP at 869 FPS. Furthermore, YOLOv6-M/L also achieves better accuracy performance (i.e., 49.5%/52.3%) than other detectors with a similar inference speed. We carefully conducted experiments to validate the effectiveness of each component. Our code is made available at https://github.com/meituan/YOLOv6.

Results

Task	Dataset	Metric	Value	Model
Autonomous Vehicles	DVTOD	mAP	84.4	YOLOv6 (Thermal)
Autonomous Vehicles	DVTOD	mAP	38.1	YOLOv6 (Visible)
Object Detection	COCO-O	Average mAP	32.5	YOLOv6-L6
Object Detection	COCO-O	Effective Robustness	6.73	YOLOv6-L6
Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	26	YOLOv6-L6(1280)
Object Detection	COCO (Common Objects in Context)	box AP	57.2	YOLOv6-L6(1280)
3D	COCO-O	Average mAP	32.5	YOLOv6-L6
3D	COCO-O	Effective Robustness	6.73	YOLOv6-L6
3D	COCO (Common Objects in Context)	FPS (V100, b=1)	26	YOLOv6-L6(1280)
3D	COCO (Common Objects in Context)	box AP	57.2	YOLOv6-L6(1280)
2D Classification	COCO-O	Average mAP	32.5	YOLOv6-L6
2D Classification	COCO-O	Effective Robustness	6.73	YOLOv6-L6
2D Classification	COCO (Common Objects in Context)	FPS (V100, b=1)	26	YOLOv6-L6(1280)
2D Classification	COCO (Common Objects in Context)	box AP	57.2	YOLOv6-L6(1280)
Pedestrian Detection	DVTOD	mAP	84.4	YOLOv6 (Thermal)
Pedestrian Detection	DVTOD	mAP	38.1	YOLOv6 (Visible)
2D Object Detection	COCO-O	Average mAP	32.5	YOLOv6-L6
2D Object Detection	COCO-O	Effective Robustness	6.73	YOLOv6-L6
2D Object Detection	COCO (Common Objects in Context)	FPS (V100, b=1)	26	YOLOv6-L6(1280)
2D Object Detection	COCO (Common Objects in Context)	box AP	57.2	YOLOv6-L6(1280)
16k	COCO-O	Average mAP	32.5	YOLOv6-L6
16k	COCO-O	Effective Robustness	6.73	YOLOv6-L6
16k	COCO (Common Objects in Context)	FPS (V100, b=1)	26	YOLOv6-L6(1280)
16k	COCO (Common Objects in Context)	box AP	57.2	YOLOv6-L6(1280)

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

Abstract

Results

Related Papers

YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications

Abstract

Results

Related Papers