TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Progressive End-to-End Object Detection in Crowded Scenes

Progressive End-to-End Object Detection in Crowded Scenes

Anlin Zheng, Yuang Zhang, Xiangyu Zhang, Xiaojuan Qi, Jian Sun

2022-03-15CVPR 2022 1object-detectionObject Detection
PaperPDFCode(official)Code

Abstract

In this paper, we propose a new query-based detection framework for crowd detection. Previous query-based detectors suffer from two drawbacks: first, multiple predictions will be inferred for a single object, typically in crowded scenes; second, the performance saturates as the depth of the decoding stage increases. Benefiting from the nature of the one-to-one label assignment rule, we propose a progressive predicting method to address the above issues. Specifically, we first select accepted queries prone to generate true positive predictions, then refine the rest noisy queries according to the previously accepted predictions. Experiments show that our method can significantly boost the performance of query-based detectors in crowded scenes. Equipped with our approach, Sparse RCNN achieves 92.0\% $\text{AP}$, 41.4\% $\text{MR}^{-2}$ and 83.2\% $\text{JI}$ on the challenging CrowdHuman \cite{shao2018crowdhuman} dataset, outperforming the box-based method MIP \cite{chu2020detection} that specifies in handling crowded scenarios. Moreover, the proposed method, robust to crowdedness, can still obtain consistent improvements on moderately and slightly crowded datasets like CityPersons \cite{zhang2017citypersons} and COCO \cite{lin2014microsoft}. Code will be made publicly available at https://github.com/megvii-model/Iter-E2EDET.

Results

TaskDatasetMetricValueModel
Object DetectionCrowdHuman (full body)AP94.1Progressive DETR
Object DetectionCrowdHuman (full body)mMR37.7Progressive DETR
Object DetectionCrowdHumanAP92.5S-RCNN+Ours
Object DetectionCrowdHumanMR^-241.4S-RCNN+Ours
3DCrowdHuman (full body)AP94.1Progressive DETR
3DCrowdHuman (full body)mMR37.7Progressive DETR
3DCrowdHumanAP92.5S-RCNN+Ours
3DCrowdHumanMR^-241.4S-RCNN+Ours
2D ClassificationCrowdHuman (full body)AP94.1Progressive DETR
2D ClassificationCrowdHuman (full body)mMR37.7Progressive DETR
2D ClassificationCrowdHumanAP92.5S-RCNN+Ours
2D ClassificationCrowdHumanMR^-241.4S-RCNN+Ours
2D Object DetectionCrowdHuman (full body)AP94.1Progressive DETR
2D Object DetectionCrowdHuman (full body)mMR37.7Progressive DETR
2D Object DetectionCrowdHumanAP92.5S-RCNN+Ours
2D Object DetectionCrowdHumanMR^-241.4S-RCNN+Ours
16kCrowdHuman (full body)AP94.1Progressive DETR
16kCrowdHuman (full body)mMR37.7Progressive DETR
16kCrowdHumanAP92.5S-RCNN+Ours
16kCrowdHumanMR^-241.4S-RCNN+Ours

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07