Drone-based Object Counting by Spatially Regularized Regional Proposal Network

Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu

2017-07-19ICCV 2017 10Region Proposal Object Counting

Abstract

Existing counting methods often adopt regression-based approaches and cannot precisely localize the target objects, which hinders the further analysis (e.g., high-level understanding and fine-grained classification). In addition, most of prior work mainly focus on counting objects in static environments with fixed cameras. Motivated by the advent of unmanned flying vehicles (i.e., drones), we are interested in detecting and counting objects in such dynamic environments. We propose Layout Proposal Networks (LPNs) and spatial kernels to simultaneously count and localize target objects (e.g., cars) in videos recorded by the drone. Different from the conventional region proposal methods, we leverage the spatial layout information (e.g., cars often park regularly) and introduce these spatially regularized constraints into our network to improve the localization accuracy. To evaluate our counting method, we present a new large-scale car parking lot dataset (CARPK) that contains nearly 90,000 cars captured from different parking lots. To the best of our knowledge, it is the first and the largest drone view dataset that supports object counting, and provides the bounding box annotations.

Results

Task	Dataset	Metric	Value	Model
Object Counting	CARPK	MAE	16.62	RetinaNet (2018)
Object Counting	CARPK	RMSE	22.3	RetinaNet (2018)
Object Counting	CARPK	MAE	22.76	LPN Counting (2017)
Object Counting	CARPK	RMSE	34.46	LPN Counting (2017)

Related Papers

Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11 Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets2025-06-05 OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models2025-06-03 Improving Contrastive Learning for Referring Expression Counting2025-05-28 InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition2025-05-21 Expanding Zero-Shot Object Counting with Rich Prompts2025-05-21 VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning2025-05-17 Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?2025-05-17