TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOv10: Real-Time End-to-End Object Detection

YOLOv10: Real-Time End-to-End Object Detection

Ao Wang, Hui Chen, Lihao Liu, Kai Chen, Zijia Lin, Jungong Han, Guiguang Ding

2024-05-23Data AugmentationReal-Time Object Detection2D Object Detectionobject-detectionObject Detection
PaperPDFCodeCode(official)Code

Abstract

Over the past years, YOLOs have emerged as the predominant paradigm in the field of real-time object detection owing to their effective balance between computational cost and detection performance. Researchers have explored the architectural designs, optimization objectives, data augmentation strategies, and others for YOLOs, achieving notable progress. However, the reliance on the non-maximum suppression (NMS) for post-processing hampers the end-to-end deployment of YOLOs and adversely impacts the inference latency. Besides, the design of various components in YOLOs lacks the comprehensive and thorough inspection, resulting in noticeable computational redundancy and limiting the model's capability. It renders the suboptimal efficiency, along with considerable potential for performance improvements. In this work, we aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture. To this end, we first present the consistent dual assignments for NMS-free training of YOLOs, which brings competitive performance and low inference latency simultaneously. Moreover, we introduce the holistic efficiency-accuracy driven model design strategy for YOLOs. We comprehensively optimize various components of YOLOs from both efficiency and accuracy perspectives, which greatly reduces the computational overhead and enhances the capability. The outcome of our effort is a new generation of YOLO series for real-time end-to-end object detection, dubbed YOLOv10. Extensive experiments show that YOLOv10 achieves state-of-the-art performance and efficiency across various model scales. For example, our YOLOv10-S is 1.8$\times$ faster than RT-DETR-R18 under the similar AP on COCO, meanwhile enjoying 2.8$\times$ smaller number of parameters and FLOPs. Compared with YOLOv9-C, YOLOv10-B has 46\% less latency and 25\% fewer parameters for the same performance.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO (Common Objects in Context)box AP54.4YOLOv10-X
Object DetectionCOCO (Common Objects in Context)box AP53.4YOLOv10-L
Object DetectionCOCO (Common Objects in Context)box AP52.7YOLOv10-B
Object DetectionCOCO (Common Objects in Context)box AP51.3YOLOv10-M
Object DetectionCOCO (Common Objects in Context)box AP46.8YOLOv10-S
Object DetectionCOCO (Common Objects in Context)box AP39.5YOLOv10-N
3DCOCO (Common Objects in Context)box AP54.4YOLOv10-X
3DCOCO (Common Objects in Context)box AP53.4YOLOv10-L
3DCOCO (Common Objects in Context)box AP52.7YOLOv10-B
3DCOCO (Common Objects in Context)box AP51.3YOLOv10-M
3DCOCO (Common Objects in Context)box AP46.8YOLOv10-S
3DCOCO (Common Objects in Context)box AP39.5YOLOv10-N
2D ClassificationCOCO (Common Objects in Context)box AP54.4YOLOv10-X
2D ClassificationCOCO (Common Objects in Context)box AP53.4YOLOv10-L
2D ClassificationCOCO (Common Objects in Context)box AP52.7YOLOv10-B
2D ClassificationCOCO (Common Objects in Context)box AP51.3YOLOv10-M
2D ClassificationCOCO (Common Objects in Context)box AP46.8YOLOv10-S
2D ClassificationCOCO (Common Objects in Context)box AP39.5YOLOv10-N
2D Object DetectionCOCO (Common Objects in Context)box AP54.4YOLOv10-X
2D Object DetectionCOCO (Common Objects in Context)box AP53.4YOLOv10-L
2D Object DetectionCOCO (Common Objects in Context)box AP52.7YOLOv10-B
2D Object DetectionCOCO (Common Objects in Context)box AP51.3YOLOv10-M
2D Object DetectionCOCO (Common Objects in Context)box AP46.8YOLOv10-S
2D Object DetectionCOCO (Common Objects in Context)box AP39.5YOLOv10-N
16kCOCO (Common Objects in Context)box AP54.4YOLOv10-X
16kCOCO (Common Objects in Context)box AP53.4YOLOv10-L
16kCOCO (Common Objects in Context)box AP52.7YOLOv10-B
16kCOCO (Common Objects in Context)box AP51.3YOLOv10-M
16kCOCO (Common Objects in Context)box AP46.8YOLOv10-S
16kCOCO (Common Objects in Context)box AP39.5YOLOv10-N

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16