TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FusionCount: Efficient Crowd Counting via Multiscale Featu...

FusionCount: Efficient Crowd Counting via Multiscale Feature Fusion

Yiming Ma, Victor Sanchez, Tanaya Guha

2022-02-28Crowd Counting
PaperPDFCode(official)

Abstract

State-of-the-art crowd counting models follow an encoder-decoder approach. Images are first processed by the encoder to extract features. Then, to account for perspective distortion, the highest-level feature map is fed to extra components to extract multiscale features, which are the input to the decoder to generate crowd densities. However, in these methods, features extracted at earlier stages during encoding are underutilised, and the multiscale modules can only capture a limited range of receptive fields, albeit with considerable computational cost. This paper proposes a novel crowd counting architecture (FusionCount), which exploits the adaptive fusion of a large majority of encoded features instead of relying on additional extraction components to obtain multiscale features. Thus, it can cover a more extensive scope of receptive field sizes and lower the computational cost. We also introduce a new channel reduction block, which can extract saliency information during decoding and further enhance the model's performance. Experiments on two benchmark databases demonstrate that our model achieves state-of-the-art results with reduced computational complexity.

Results

TaskDatasetMetricValueModel
CrowdsShanghaiTech BMAE6.9FusionCount
CrowdsShanghaiTech BMSE11.8FusionCount
CrowdsShanghaiTech AMAE62.2FusionCount
CrowdsShanghaiTech AMSE101.2FusionCount

Related Papers

Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression2025-06-24Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting2025-05-28Crowd Scene Analysis using Deep Learning Techniques2025-05-13Transformer-Based Dual-Optical Attention Fusion Crowd Head Point Counting and Localization Network2025-05-11A Short Overview of Multi-Modal Wi-Fi Sensing2025-05-10Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining2025-04-29ProgRoCC: A Progressive Approach to Rough Crowd Counting2025-04-18