3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Rui Qiu, Ming Xu, Yuyao Yan, Jeremy S. Smith, Xi Yang

2022-07-22Multiview Detection Data Augmentation Pedestrian Detection

Abstract

Although deep-learning based methods for monocular pedestrian detection have made great progress, they are still vulnerable to heavy occlusions. Using multi-view information fusion is a potential solution but has limited applications, due to the lack of annotated training samples in existing multi-view datasets, which increases the risk of overfitting. To address this problem, a data augmentation method is proposed to randomly generate 3D cylinder occlusions, on the ground plane, which are of the average size of pedestrians and projected to multiple views, to relieve the impact of overfitting in the training. Moreover, the feature map of each view is projected to multiple parallel planes at different heights, by using homographies, which allows the CNNs to fully utilize the features across the height of each pedestrian to infer the locations of pedestrians on the ground plane. The proposed 3DROM method has a greatly improved performance in comparison with the state-of-the-art deep-learning based methods for multi-view pedestrian detection.

Results

Task	Dataset	Metric	Value	Model
Object Detection	Wildtrack	MODA	93.5	3DROM
Object Detection	Wildtrack	MODP	75.9	3DROM
Object Detection	Wildtrack	Recall	96.2	3DROM
Object Detection	CityStreet	F1_score (2m)	79.2	3DROM
Object Detection	CityStreet	MODA (2m)	60	3DROM
Object Detection	CityStreet	MODP (2m)	70.1	3DROM
Object Detection	CityStreet	Precision (2m)	82.5	3DROM
Object Detection	CityStreet	Recall (2m)	76.2	3DROM
Object Detection	CVCS	F1_score (1m)	55.1	3DROM
Object Detection	CVCS	MODA (1m)	33.9	3DROM
Object Detection	CVCS	MODP (1m)	73.9	3DROM
Object Detection	CVCS	Precision (1m)	79.5	3DROM
Object Detection	CVCS	Recall (1m)	42.2	3DROM
Object Detection	MultiviewX	MODA	90	3DROM
Object Detection	MultiviewX	MODP	83.7	3DROM
3D	Wildtrack	MODA	93.5	3DROM
3D	Wildtrack	MODP	75.9	3DROM
3D	Wildtrack	Recall	96.2	3DROM
3D	CityStreet	F1_score (2m)	79.2	3DROM
3D	CityStreet	MODA (2m)	60	3DROM
3D	CityStreet	MODP (2m)	70.1	3DROM
3D	CityStreet	Precision (2m)	82.5	3DROM
3D	CityStreet	Recall (2m)	76.2	3DROM
3D	CVCS	F1_score (1m)	55.1	3DROM
3D	CVCS	MODA (1m)	33.9	3DROM
3D	CVCS	MODP (1m)	73.9	3DROM
3D	CVCS	Precision (1m)	79.5	3DROM
3D	CVCS	Recall (1m)	42.2	3DROM
3D	MultiviewX	MODA	90	3DROM
3D	MultiviewX	MODP	83.7	3DROM
3D Object Detection	Wildtrack	MODA	93.5	3DROM
3D Object Detection	Wildtrack	MODP	75.9	3DROM
3D Object Detection	Wildtrack	Recall	96.2	3DROM
3D Object Detection	CityStreet	F1_score (2m)	79.2	3DROM
3D Object Detection	CityStreet	MODA (2m)	60	3DROM
3D Object Detection	CityStreet	MODP (2m)	70.1	3DROM
3D Object Detection	CityStreet	Precision (2m)	82.5	3DROM
3D Object Detection	CityStreet	Recall (2m)	76.2	3DROM
3D Object Detection	CVCS	F1_score (1m)	55.1	3DROM
3D Object Detection	CVCS	MODA (1m)	33.9	3DROM
3D Object Detection	CVCS	MODP (1m)	73.9	3DROM
3D Object Detection	CVCS	Precision (1m)	79.5	3DROM
3D Object Detection	CVCS	Recall (1m)	42.2	3DROM
3D Object Detection	MultiviewX	MODA	90	3DROM
3D Object Detection	MultiviewX	MODP	83.7	3DROM
2D Classification	Wildtrack	MODA	93.5	3DROM
2D Classification	Wildtrack	MODP	75.9	3DROM
2D Classification	Wildtrack	Recall	96.2	3DROM
2D Classification	CityStreet	F1_score (2m)	79.2	3DROM
2D Classification	CityStreet	MODA (2m)	60	3DROM
2D Classification	CityStreet	MODP (2m)	70.1	3DROM
2D Classification	CityStreet	Precision (2m)	82.5	3DROM
2D Classification	CityStreet	Recall (2m)	76.2	3DROM
2D Classification	CVCS	F1_score (1m)	55.1	3DROM
2D Classification	CVCS	MODA (1m)	33.9	3DROM
2D Classification	CVCS	MODP (1m)	73.9	3DROM
2D Classification	CVCS	Precision (1m)	79.5	3DROM
2D Classification	CVCS	Recall (1m)	42.2	3DROM
2D Classification	MultiviewX	MODA	90	3DROM
2D Classification	MultiviewX	MODP	83.7	3DROM
2D Object Detection	Wildtrack	MODA	93.5	3DROM
2D Object Detection	Wildtrack	MODP	75.9	3DROM
2D Object Detection	Wildtrack	Recall	96.2	3DROM
2D Object Detection	CityStreet	F1_score (2m)	79.2	3DROM
2D Object Detection	CityStreet	MODA (2m)	60	3DROM
2D Object Detection	CityStreet	MODP (2m)	70.1	3DROM
2D Object Detection	CityStreet	Precision (2m)	82.5	3DROM
2D Object Detection	CityStreet	Recall (2m)	76.2	3DROM
2D Object Detection	CVCS	F1_score (1m)	55.1	3DROM
2D Object Detection	CVCS	MODA (1m)	33.9	3DROM
2D Object Detection	CVCS	MODP (1m)	73.9	3DROM
2D Object Detection	CVCS	Precision (1m)	79.5	3DROM
2D Object Detection	CVCS	Recall (1m)	42.2	3DROM
2D Object Detection	MultiviewX	MODA	90	3DROM
2D Object Detection	MultiviewX	MODP	83.7	3DROM
16k	Wildtrack	MODA	93.5	3DROM
16k	Wildtrack	MODP	75.9	3DROM
16k	Wildtrack	Recall	96.2	3DROM
16k	CityStreet	F1_score (2m)	79.2	3DROM
16k	CityStreet	MODA (2m)	60	3DROM
16k	CityStreet	MODP (2m)	70.1	3DROM
16k	CityStreet	Precision (2m)	82.5	3DROM
16k	CityStreet	Recall (2m)	76.2	3DROM
16k	CVCS	F1_score (1m)	55.1	3DROM
16k	CVCS	MODA (1m)	33.9	3DROM
16k	CVCS	MODP (1m)	73.9	3DROM
16k	CVCS	Precision (1m)	79.5	3DROM
16k	CVCS	Recall (1m)	42.2	3DROM
16k	MultiviewX	MODA	90	3DROM
16k	MultiviewX	MODP	83.7	3DROM

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Abstract

Results

Related Papers

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Abstract

Results

Related Papers