TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-View People Detection in Large Scenes via Supervised...

Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting

Qi Zhang, Yunfei Gong, Daijie Chen, Antoni B. Chan, Hui Huang

2024-05-30Camera CalibrationMultiview DetectionDomain Adaptation
PaperPDFCode(official)

Abstract

Recent deep learning-based multi-view people detection (MVD) methods have shown promising results on existing datasets. However, current methods are mainly trained and evaluated on small, single scenes with a limited number of multi-view frames and fixed camera views. As a result, these methods may not be practical for detecting people in larger, more complex scenes with severe occlusions and camera calibration errors. This paper focuses on improving multi-view people detection by developing a supervised view-wise contribution weighting approach that better fuses multi-camera information under large scenes. Besides, a large synthetic dataset is adopted to enhance the model's generalization ability and enable more practical evaluation and comparison. The model's performance on new testing scenes is further improved with a simple domain adaptation technique. Experimental results demonstrate the effectiveness of our approach in achieving promising cross-scene multi-view people detection performance. See code here: https://vcc.tech/research/2024/MVD.

Results

TaskDatasetMetricValueModel
Object DetectionCityStreetF1_score (2m)76SVCW
Object DetectionCityStreetMODA (2m)55SVCW
Object DetectionCityStreetMODP (2m)70SVCW
Object DetectionCityStreetPrecision (2m)81.4SVCW
Object DetectionCityStreetRecall (2m)71.2SVCW
Object DetectionCVCSF1_score (1m)68.4SVCW
Object DetectionCVCSMODA (1m)46.2SVCW
Object DetectionCVCSMODP (1m)78.4SVCW
Object DetectionCVCSPrecision (1m)81.2SVCW
Object DetectionCVCSRecall (1m)59.1SVCW
3DCityStreetF1_score (2m)76SVCW
3DCityStreetMODA (2m)55SVCW
3DCityStreetMODP (2m)70SVCW
3DCityStreetPrecision (2m)81.4SVCW
3DCityStreetRecall (2m)71.2SVCW
3DCVCSF1_score (1m)68.4SVCW
3DCVCSMODA (1m)46.2SVCW
3DCVCSMODP (1m)78.4SVCW
3DCVCSPrecision (1m)81.2SVCW
3DCVCSRecall (1m)59.1SVCW
3D Object DetectionCityStreetF1_score (2m)76SVCW
3D Object DetectionCityStreetMODA (2m)55SVCW
3D Object DetectionCityStreetMODP (2m)70SVCW
3D Object DetectionCityStreetPrecision (2m)81.4SVCW
3D Object DetectionCityStreetRecall (2m)71.2SVCW
3D Object DetectionCVCSF1_score (1m)68.4SVCW
3D Object DetectionCVCSMODA (1m)46.2SVCW
3D Object DetectionCVCSMODP (1m)78.4SVCW
3D Object DetectionCVCSPrecision (1m)81.2SVCW
3D Object DetectionCVCSRecall (1m)59.1SVCW
2D ClassificationCityStreetF1_score (2m)76SVCW
2D ClassificationCityStreetMODA (2m)55SVCW
2D ClassificationCityStreetMODP (2m)70SVCW
2D ClassificationCityStreetPrecision (2m)81.4SVCW
2D ClassificationCityStreetRecall (2m)71.2SVCW
2D ClassificationCVCSF1_score (1m)68.4SVCW
2D ClassificationCVCSMODA (1m)46.2SVCW
2D ClassificationCVCSMODP (1m)78.4SVCW
2D ClassificationCVCSPrecision (1m)81.2SVCW
2D ClassificationCVCSRecall (1m)59.1SVCW
2D Object DetectionCityStreetF1_score (2m)76SVCW
2D Object DetectionCityStreetMODA (2m)55SVCW
2D Object DetectionCityStreetMODP (2m)70SVCW
2D Object DetectionCityStreetPrecision (2m)81.4SVCW
2D Object DetectionCityStreetRecall (2m)71.2SVCW
2D Object DetectionCVCSF1_score (1m)68.4SVCW
2D Object DetectionCVCSMODA (1m)46.2SVCW
2D Object DetectionCVCSMODP (1m)78.4SVCW
2D Object DetectionCVCSPrecision (1m)81.2SVCW
2D Object DetectionCVCSRecall (1m)59.1SVCW
16kCityStreetF1_score (2m)76SVCW
16kCityStreetMODA (2m)55SVCW
16kCityStreetMODP (2m)70SVCW
16kCityStreetPrecision (2m)81.4SVCW
16kCityStreetRecall (2m)71.2SVCW
16kCVCSF1_score (1m)68.4SVCW
16kCVCSMODA (1m)46.2SVCW
16kCVCSMODP (1m)78.4SVCW
16kCVCSPrecision (1m)81.2SVCW
16kCVCSRecall (1m)59.1SVCW

Related Papers

A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation2025-07-14An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation2025-07-11The Bayesian Approach to Continual Learning: An Overview2025-07-11Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble Fusion2025-07-04Underwater Monocular Metric Depth Estimation: Real-World Benchmarks and Synthetic Fine-Tuning2025-07-02