TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IterDet: Iterative Scheme for Object Detection in Crowded ...

IterDet: Iterative Scheme for Object Detection in Crowded Environments

Danila Rukhovich, Konstantin Sofiiuk, Danil Galeev, Olga Barinova, Anton Konushin

2020-05-12object-detectionObject Detection
PaperPDF

Abstract

Deep learning-based detectors usually produce a redundant set of object bounding boxes including many duplicate detections of the same object. These boxes are then filtered using non-maximum suppression (NMS) in order to select exactly one bounding box per object of interest. This greedy scheme is simple and provides sufficient accuracy for isolated objects but often fails in crowded environments, since one needs to both preserve boxes for different objects and suppress duplicate detections. In this work we develop an alternative iterative scheme, where a new subset of objects is detected at each iteration. Detected boxes from the previous iterations are passed to the network at the following iterations to ensure that the same object would not be detected twice. This iterative scheme can be applied to both one-stage and two-stage object detectors with just minor modifications of the training and inference procedures. We perform extensive experiments with two different baseline detectors on four datasets and show significant improvement over the baseline, leading to state-of-the-art performance on CrowdHuman and WiderPerson datasets. The source code and the trained models are available at https://github.com/saic-vul/iterdet.

Results

TaskDatasetMetricValueModel
Object DetectionCrowdHuman (full body)AP88.08IterDet (Faster RCNN, ResNet50, 2 iterations)
Object DetectionCrowdHuman (full body)mMR49.44IterDet (Faster RCNN, ResNet50, 2 iterations)
Object DetectionCrowdHuman (full body)AP84.43IterDet (Faster RCNN, ResNet50, 1 iteration)
Object DetectionCrowdHuman (full body)mMR49.12IterDet (Faster RCNN, ResNet50, 1 iteration)
Object DetectionWiderPersonAP91.95IterDet (Faster RCNN, ResNet50, 2 iterations)
Object DetectionWiderPersonmMR40.78IterDet (Faster RCNN, ResNet50, 2 iterations)
Object DetectionWiderPersonAP89.49IterDet (Faster RCNN, ResNet50, 1 iteration)
Object DetectionWiderPersonmMR40.35IterDet (Faster RCNN, ResNet50, 1 iteration)
3DCrowdHuman (full body)AP88.08IterDet (Faster RCNN, ResNet50, 2 iterations)
3DCrowdHuman (full body)mMR49.44IterDet (Faster RCNN, ResNet50, 2 iterations)
3DCrowdHuman (full body)AP84.43IterDet (Faster RCNN, ResNet50, 1 iteration)
3DCrowdHuman (full body)mMR49.12IterDet (Faster RCNN, ResNet50, 1 iteration)
3DWiderPersonAP91.95IterDet (Faster RCNN, ResNet50, 2 iterations)
3DWiderPersonmMR40.78IterDet (Faster RCNN, ResNet50, 2 iterations)
3DWiderPersonAP89.49IterDet (Faster RCNN, ResNet50, 1 iteration)
3DWiderPersonmMR40.35IterDet (Faster RCNN, ResNet50, 1 iteration)
2D ClassificationCrowdHuman (full body)AP88.08IterDet (Faster RCNN, ResNet50, 2 iterations)
2D ClassificationCrowdHuman (full body)mMR49.44IterDet (Faster RCNN, ResNet50, 2 iterations)
2D ClassificationCrowdHuman (full body)AP84.43IterDet (Faster RCNN, ResNet50, 1 iteration)
2D ClassificationCrowdHuman (full body)mMR49.12IterDet (Faster RCNN, ResNet50, 1 iteration)
2D ClassificationWiderPersonAP91.95IterDet (Faster RCNN, ResNet50, 2 iterations)
2D ClassificationWiderPersonmMR40.78IterDet (Faster RCNN, ResNet50, 2 iterations)
2D ClassificationWiderPersonAP89.49IterDet (Faster RCNN, ResNet50, 1 iteration)
2D ClassificationWiderPersonmMR40.35IterDet (Faster RCNN, ResNet50, 1 iteration)
2D Object DetectionCrowdHuman (full body)AP88.08IterDet (Faster RCNN, ResNet50, 2 iterations)
2D Object DetectionCrowdHuman (full body)mMR49.44IterDet (Faster RCNN, ResNet50, 2 iterations)
2D Object DetectionCrowdHuman (full body)AP84.43IterDet (Faster RCNN, ResNet50, 1 iteration)
2D Object DetectionCrowdHuman (full body)mMR49.12IterDet (Faster RCNN, ResNet50, 1 iteration)
2D Object DetectionWiderPersonAP91.95IterDet (Faster RCNN, ResNet50, 2 iterations)
2D Object DetectionWiderPersonmMR40.78IterDet (Faster RCNN, ResNet50, 2 iterations)
2D Object DetectionWiderPersonAP89.49IterDet (Faster RCNN, ResNet50, 1 iteration)
2D Object DetectionWiderPersonmMR40.35IterDet (Faster RCNN, ResNet50, 1 iteration)
16kCrowdHuman (full body)AP88.08IterDet (Faster RCNN, ResNet50, 2 iterations)
16kCrowdHuman (full body)mMR49.44IterDet (Faster RCNN, ResNet50, 2 iterations)
16kCrowdHuman (full body)AP84.43IterDet (Faster RCNN, ResNet50, 1 iteration)
16kCrowdHuman (full body)mMR49.12IterDet (Faster RCNN, ResNet50, 1 iteration)
16kWiderPersonAP91.95IterDet (Faster RCNN, ResNet50, 2 iterations)
16kWiderPersonmMR40.78IterDet (Faster RCNN, ResNet50, 2 iterations)
16kWiderPersonAP89.49IterDet (Faster RCNN, ResNet50, 1 iteration)
16kWiderPersonmMR40.35IterDet (Faster RCNN, ResNet50, 1 iteration)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07