TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLO-Drone:Airborne real-time detection of dense small obj...

YOLO-Drone:Airborne real-time detection of dense small objects from high-altitude perspective

Li Zhu, Jiahui Xiong, Feng Xiong, Hanzheng Hu, Zhengnan Jiang

2023-04-14Real-Time Object Detectionobject-detectionObject Detection
PaperPDF

Abstract

Unmanned Aerial Vehicles (UAVs), specifically drones equipped with remote sensing object detection technology, have rapidly gained a broad spectrum of applications and emerged as one of the primary research focuses in the field of computer vision. Although UAV remote sensing systems have the ability to detect various objects, small-scale objects can be challenging to detect reliably due to factors such as object size, image degradation, and real-time limitations. To tackle these issues, a real-time object detection algorithm (YOLO-Drone) is proposed and applied to two new UAV platforms as well as a specific light source (silicon-based golden LED). YOLO-Drone presents several novelties: 1) including a new backbone Darknet59; 2) a new complex feature aggregation module MSPP-FPN that incorporated one spatial pyramid pooling and three atrous spatial pyramid pooling modules; 3) and the use of Generalized Intersection over Union (GIoU) as the loss function. To evaluate performance, two benchmark datasets, UAVDT and VisDrone, along with one homemade dataset acquired at night under silicon-based golden LEDs, are utilized. The experimental results show that, in both UAVDT and VisDrone, the proposed YOLO-Drone outperforms state-of-the-art (SOTA) object detection methods by improving the mAP of 10.13% and 8.59%, respectively. With regards to UAVDT, the YOLO-Drone exhibits both high real-time inference speed of 53 FPS and a maximum mAP of 34.04%. Notably, YOLO-Drone achieves high performance under the silicon-based golden LEDs, with a mAP of up to 87.71%, surpassing the performance of YOLO series under ordinary light sources. To conclude, the proposed YOLO-Drone is a highly effective solution for object detection in UAV applications, particularly for night detection tasks where silicon-based golden light LED technology exhibits significant superiority.

Results

TaskDatasetMetricValueModel
Object DetectionIndia Driving DatasetmAP@0.530.3YOLOv5x
Object DetectionIndia Driving DatasetmAP@0.559.87YOLO-Drone
Object DetectionCOCO 2017mAP35.45YOLO-Drone
3DIndia Driving DatasetmAP@0.530.3YOLOv5x
3DIndia Driving DatasetmAP@0.559.87YOLO-Drone
3DCOCO 2017mAP35.45YOLO-Drone
2D ClassificationIndia Driving DatasetmAP@0.530.3YOLOv5x
2D ClassificationIndia Driving DatasetmAP@0.559.87YOLO-Drone
2D ClassificationCOCO 2017mAP35.45YOLO-Drone
2D Object DetectionIndia Driving DatasetmAP@0.530.3YOLOv5x
2D Object DetectionIndia Driving DatasetmAP@0.559.87YOLO-Drone
2D Object DetectionCOCO 2017mAP35.45YOLO-Drone
16kIndia Driving DatasetmAP@0.530.3YOLOv5x
16kIndia Driving DatasetmAP@0.559.87YOLO-Drone
16kCOCO 2017mAP35.45YOLO-Drone

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07