TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bringing Generalization to Deep Multi-View Pedestrian Dete...

Bringing Generalization to Deep Multi-View Pedestrian Detection

Jeet Vora, Swetanjal Dutta, Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi

2021-09-24Multiview DetectionPedestrian Detection
PaperPDFCode(official)

Abstract

Multi-view Detection (MVD) is highly effective for occlusion reasoning in a crowded environment. While recent works using deep learning have made significant advances in the field, they have overlooked the generalization aspect, which makes them impractical for real-world deployment. The key novelty of our work is to formalize three critical forms of generalization and propose experiments to evaluate them: generalization with i) a varying number of cameras, ii) varying camera positions, and finally, iii) to new scenes. We find that existing state-of-the-art models show poor generalization by overfitting to a single scene and camera configuration. To address the concerns: (a) we propose a novel Generalized MVD (GMVD) dataset, assimilating diverse scenes with changing daytime, camera configurations, varying number of cameras, and (b) we discuss the properties essential to bring generalization to MVD and propose a barebones model to incorporate them. We perform a comprehensive set of experiments on the WildTrack, MultiViewX, and the GMVD datasets to motivate the necessity to evaluate the generalization abilities of MVD methods and to demonstrate the efficacy of the proposed approach. The code and the proposed dataset can be found at https://github.com/jeetv/GMVD

Results

TaskDatasetMetricValueModel
Object DetectionGMVDMODA68.2GMVD
Object DetectionGMVDRecall75.5GMVD
3DGMVDMODA68.2GMVD
3DGMVDRecall75.5GMVD
3D Object DetectionGMVDMODA68.2GMVD
3D Object DetectionGMVDRecall75.5GMVD
2D ClassificationGMVDMODA68.2GMVD
2D ClassificationGMVDRecall75.5GMVD
2D Object DetectionGMVDMODA68.2GMVD
2D Object DetectionGMVDRecall75.5GMVD
16kGMVDMODA68.2GMVD
16kGMVDRecall75.5GMVD

Related Papers

YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras2025-05-23Attention-Aware Multi-View Pedestrian Tracking2025-04-03Panoramic Distortion-Aware Tokenization for Person Detection and Localization Using Transformers in Overhead Fisheye Images2025-03-18Enhanced Multi-View Pedestrian Detection Using Probabilistic Occupancy Volume2025-03-14Adversarial Attacks on Event-Based Pedestrian Detectors: A Physical Approach2025-03-01PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments2025-02-21PedDet: Adaptive Spectral Optimization for Multimodal Pedestrian Detection2025-02-19