M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Garrick Brazil, Xiaoming Liu

2019-07-13ICCV 2019 103D Object Detection From Monocular Images Region Proposal Monocular 3D Object Detection Scene Understanding Autonomous Driving Vehicle Pose Estimation object-detection 3D Object Detection Object Detection

Paper PDF Code Code Code(official)Code

Abstract

Understanding the world in 3D is a critical component of urban autonomous driving. Generally, the combination of expensive LiDAR sensors and stereo RGB imaging has been paramount for successful 3D object detection algorithms, whereas monocular image-only methods experience drastically reduced performance. We propose to reduce the gap by reformulating the monocular 3D detection problem as a standalone 3D region proposal network. We leverage the geometric relationship of 2D and 3D perspectives, allowing 3D boxes to utilize well-known and powerful convolutional features generated in the image-space. To help address the strenuous 3D parameter estimations, we further design depth-aware convolutional layers which enable location specific feature development and in consequence improved 3D scene understanding. Compared to prior work in monocular 3D detection, our method consists of only the proposed 3D region proposal network rather than relying on external networks, data, or multiple stages. M3D-RPN is able to significantly improve the performance of both monocular 3D Object Detection and Bird's Eye View tasks within the KITTI urban autonomous driving dataset, while efficiently using a shared multi-class model.

Results

Task	Dataset	Metric	Value	Model
Pose Estimation	KITTI Cars Hard	Average Orientation Similarity	67.08	M3D-RPN
Object Detection	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
Object Detection	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
Object Detection	Waymo Open Dataset	3D mAPH Vehicle (Front Camera Only)	0.65	M3D-RPN
3D	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
3D	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
3D	Waymo Open Dataset	3D mAPH Vehicle (Front Camera Only)	0.65	M3D-RPN
3D	KITTI Cars Hard	Average Orientation Similarity	67.08	M3D-RPN
3D Object Detection	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
3D Object Detection	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
2D Classification	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
2D Classification	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
2D Classification	Waymo Open Dataset	3D mAPH Vehicle (Front Camera Only)	0.65	M3D-RPN
2D Object Detection	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
2D Object Detection	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
2D Object Detection	Waymo Open Dataset	3D mAPH Vehicle (Front Camera Only)	0.65	M3D-RPN
1 Image, 2*2 Stitchi	KITTI Cars Hard	Average Orientation Similarity	67.08	M3D-RPN
16k	Rope3D	AP@0.7	16.75	M3D-RPN+(G)
16k	KITTI Cars Moderate	AP Medium	9.71	M3D-RPN
16k	Waymo Open Dataset	3D mAPH Vehicle (Front Camera Only)	0.65	M3D-RPN

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Abstract

Results

Related Papers

M3D-RPN: Monocular 3D Region Proposal Network for Object Detection

Abstract

Results

Related Papers