TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Drone-based Object Counting by Spatially Regularized Regio...

Drone-based Object Counting by Spatially Regularized Regional Proposal Network

Meng-Ru Hsieh, Yen-Liang Lin, Winston H. Hsu

2017-07-19ICCV 2017 10Region ProposalObject Counting
PaperPDF

Abstract

Existing counting methods often adopt regression-based approaches and cannot precisely localize the target objects, which hinders the further analysis (e.g., high-level understanding and fine-grained classification). In addition, most of prior work mainly focus on counting objects in static environments with fixed cameras. Motivated by the advent of unmanned flying vehicles (i.e., drones), we are interested in detecting and counting objects in such dynamic environments. We propose Layout Proposal Networks (LPNs) and spatial kernels to simultaneously count and localize target objects (e.g., cars) in videos recorded by the drone. Different from the conventional region proposal methods, we leverage the spatial layout information (e.g., cars often park regularly) and introduce these spatially regularized constraints into our network to improve the localization accuracy. To evaluate our counting method, we present a new large-scale car parking lot dataset (CARPK) that contains nearly 90,000 cars captured from different parking lots. To the best of our knowledge, it is the first and the largest drone view dataset that supports object counting, and provides the bounding box annotations.

Results

TaskDatasetMetricValueModel
Object CountingCARPKMAE16.62RetinaNet (2018)
Object CountingCARPKRMSE22.3RetinaNet (2018)
Object CountingCARPKMAE22.76LPN Counting (2017)
Object CountingCARPKRMSE34.46LPN Counting (2017)

Related Papers

Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11Bridging Annotation Gaps: Transferring Labels to Align Object Detection Datasets2025-06-05OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models2025-06-03Improving Contrastive Learning for Referring Expression Counting2025-05-28InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition2025-05-21Expanding Zero-Shot Object Counting with Rich Prompts2025-05-21VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning2025-05-17Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?2025-05-17