TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Divide and Grow: Capturing Huge Diversity in Crowd Images ...

Divide and Grow: Capturing Huge Diversity in Crowd Images with Incrementally Growing CNN

Deepak Babu Sam, Neeraj N Sajjan, R. Venkatesh Babu

2018-07-26CVPR 2018 6Crowd Counting
PaperPDF

Abstract

Automated counting of people in crowd images is a challenging task. The major difficulty stems from the large diversity in the way people appear in crowds. In fact, features available for crowd discrimination largely depend on the crowd density to the extent that people are only seen as blobs in a highly dense scene. We tackle this problem with a growing CNN which can progressively increase its capacity to account for the wide variability seen in crowd scenes. Our model starts from a base CNN density regressor, which is trained in equivalence on all types of crowd images. In order to adapt with the huge diversity, we create two child regressors which are exact copies of the base CNN. A differential training procedure divides the dataset into two clusters and fine-tunes the child networks on their respective specialties. Consequently, without any hand-crafted criteria for forming specialties, the child regressors become experts on certain types of crowds. The child networks are again split recursively, creating two experts at every division. This hierarchical training leads to a CNN tree, where the child regressors are more fine experts than any of their parents. The leaf nodes are taken as the final experts and a classifier network is then trained to predict the correct specialty for a given test image patch. The proposed model achieves higher count accuracy on major crowd datasets. Further, we analyse the characteristics of specialties mined automatically by our method.

Results

TaskDatasetMetricValueModel
CrowdsShanghaiTech BMAE13.6IG-CNN
CrowdsShanghaiTech AMAE72.5IG-CNN
CrowdsUCF CC 50MAE291.4IG-CNN
CrowdsWorldExpo’10Average MAE11.3IG-CNN

Related Papers

Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression2025-06-24Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting2025-05-28Crowd Scene Analysis using Deep Learning Techniques2025-05-13Transformer-Based Dual-Optical Attention Fusion Crowd Head Point Counting and Localization Network2025-05-11A Short Overview of Multi-Modal Wi-Fi Sensing2025-05-10Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining2025-04-29ProgRoCC: A Progressive Approach to Rough Crowd Counting2025-04-18