TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/V2F-Net: Explicit Decomposition of Occluded Pedestrian Det...

V2F-Net: Explicit Decomposition of Occluded Pedestrian Detection

Mingyang Shang, Dawei Xiang, Zhicheng Wang, Erjin Zhou

2021-04-07Pedestrian DetectionObject Detection
PaperPDF

Abstract

Occlusion is very challenging in pedestrian detection. In this paper, we propose a simple yet effective method named V2F-Net, which explicitly decomposes occluded pedestrian detection into visible region detection and full body estimation. V2F-Net consists of two sub-networks: Visible region Detection Network (VDN) and Full body Estimation Network (FEN). VDN tries to localize visible regions and FEN estimates full-body box on the basis of the visible box. Moreover, to further improve the estimation of full body, we propose a novel Embedding-based Part-aware Module (EPM). By supervising the visibility for each part, the network is encouraged to extract features with essential part information. We experimentally show the effectiveness of V2F-Net by conducting several experiments on two challenging datasets. V2F-Net achieves 5.85% AP gains on CrowdHuman and 2.24% MR-2 improvements on CityPersons compared to FPN baseline. Besides, the consistent gain on both one-stage and two-stage detector validates the generalizability of our method.

Results

TaskDatasetMetricValueModel
Object DetectionCrowdHuman (full body)AP91.03V2F-Net
Object DetectionCrowdHuman (full body)Recall84.2V2F-Net
Object DetectionCrowdHuman (full body)mMR42.28V2F-Net
Object DetectionCityPersonsmMR10.08V2F-Net
3DCrowdHuman (full body)AP91.03V2F-Net
3DCrowdHuman (full body)Recall84.2V2F-Net
3DCrowdHuman (full body)mMR42.28V2F-Net
3DCityPersonsmMR10.08V2F-Net
2D ClassificationCrowdHuman (full body)AP91.03V2F-Net
2D ClassificationCrowdHuman (full body)Recall84.2V2F-Net
2D ClassificationCrowdHuman (full body)mMR42.28V2F-Net
2D ClassificationCityPersonsmMR10.08V2F-Net
2D Object DetectionCrowdHuman (full body)AP91.03V2F-Net
2D Object DetectionCrowdHuman (full body)Recall84.2V2F-Net
2D Object DetectionCrowdHuman (full body)mMR42.28V2F-Net
2D Object DetectionCityPersonsmMR10.08V2F-Net
16kCrowdHuman (full body)AP91.03V2F-Net
16kCrowdHuman (full body)Recall84.2V2F-Net
16kCrowdHuman (full body)mMR42.28V2F-Net
16kCityPersonsmMR10.08V2F-Net

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08YOLO-APD: Enhancing YOLOv8 for Robust Pedestrian Detection on Complex Road Geometries2025-07-07