TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOv12: A Breakdown of the Key Architectural Features

YOLOv12: A Breakdown of the Key Architectural Features

Mujadded Al Rabbani Alif, Muhammad Hussain

2025-02-20Real-Time Object Detectionobject-detectionObject Detection
PaperPDF

Abstract

This paper presents an architectural analysis of YOLOv12, a significant advancement in single-stage, real-time object detection building upon the strengths of its predecessors while introducing key improvements. The model incorporates an optimised backbone (R-ELAN), 7x7 separable convolutions, and FlashAttention-driven area-based attention, improving feature extraction, enhanced efficiency, and robust detections. With multiple model variants, similar to its predecessors, YOLOv12 offers scalable solutions for both latency-sensitive and high-accuracy applications. Experimental results manifest consistent gains in mean average precision (mAP) and inference speed, making YOLOv12 a compelling choice for applications in autonomous systems, security, and real-time analytics. By achieving an optimal balance between computational efficiency and performance, YOLOv12 sets a new benchmark for real-time computer vision, facilitating deployment across diverse hardware platforms, from edge devices to high-performance clusters.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO (Common Objects in Context)box AP55.2YOLOv12x
Object DetectionCOCO (Common Objects in Context)box AP53.7YOLOv12l
Object DetectionCOCO (Common Objects in Context)box AP52.5YOLOv12m
Object DetectionCOCO (Common Objects in Context)box AP48YOLOv12s
Object DetectionCOCO (Common Objects in Context)box AP40.6YOLOv12n
3DCOCO (Common Objects in Context)box AP55.2YOLOv12x
3DCOCO (Common Objects in Context)box AP53.7YOLOv12l
3DCOCO (Common Objects in Context)box AP52.5YOLOv12m
3DCOCO (Common Objects in Context)box AP48YOLOv12s
3DCOCO (Common Objects in Context)box AP40.6YOLOv12n
2D ClassificationCOCO (Common Objects in Context)box AP55.2YOLOv12x
2D ClassificationCOCO (Common Objects in Context)box AP53.7YOLOv12l
2D ClassificationCOCO (Common Objects in Context)box AP52.5YOLOv12m
2D ClassificationCOCO (Common Objects in Context)box AP48YOLOv12s
2D ClassificationCOCO (Common Objects in Context)box AP40.6YOLOv12n
2D Object DetectionCOCO (Common Objects in Context)box AP55.2YOLOv12x
2D Object DetectionCOCO (Common Objects in Context)box AP53.7YOLOv12l
2D Object DetectionCOCO (Common Objects in Context)box AP52.5YOLOv12m
2D Object DetectionCOCO (Common Objects in Context)box AP48YOLOv12s
2D Object DetectionCOCO (Common Objects in Context)box AP40.6YOLOv12n
16kCOCO (Common Objects in Context)box AP55.2YOLOv12x
16kCOCO (Common Objects in Context)box AP53.7YOLOv12l
16kCOCO (Common Objects in Context)box AP52.5YOLOv12m
16kCOCO (Common Objects in Context)box AP48YOLOv12s
16kCOCO (Common Objects in Context)box AP40.6YOLOv12n

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07