Multi-Modal 3D Object Detection by Box Matching

Zhe Liu, Xiaoqing Ye, Zhikang Zou, Xinwei He, Xiao Tan, Errui Ding, Jingdong Wang, Xiang Bai

2023-05-12Autonomous Driving object-detection 3D Object Detection Object Detection

Abstract

Multi-modal 3D object detection has received growing attention as the information from different sensors like LiDAR and cameras are complementary. Most fusion methods for 3D detection rely on an accurate alignment and calibration between 3D point clouds and RGB images. However, such an assumption is not reliable in a real-world self-driving system, as the alignment between different modalities is easily affected by asynchronous sensors and disturbed sensor placement. We propose a novel {F}usion network by {B}ox {M}atching (FBMNet) for multi-modal 3D detection, which provides an alternative way for cross-modal feature alignment by learning the correspondence at the bounding box level to free up the dependency of calibration during inference. With the learned assignments between 3D and 2D object proposals, the fusion for detection can be effectively performed by combing their ROI features. Extensive experiments on the nuScenes dataset demonstrate that our method is much more stable in dealing with challenging cases such as asynchronous sensors, misaligned sensor placement, and degenerated camera images than existing fusion methods. We hope that our FBMNet could provide an available solution to dealing with these challenging cases for safety in real autonomous driving scenarios. Codes will be publicly available at https://github.com/happinesslz/FBMNet.

Results

Task	Dataset	Metric	Value	Model
Object Detection	nuScenes	NDS	0.721	FBMNet (Ours)
Object Detection	nuScenes	mAP	0.689	FBMNet (Ours)
3D	nuScenes	NDS	0.721	FBMNet (Ours)
3D	nuScenes	mAP	0.689	FBMNet (Ours)
3D Object Detection	nuScenes	NDS	0.721	FBMNet (Ours)
3D Object Detection	nuScenes	mAP	0.689	FBMNet (Ours)
2D Classification	nuScenes	NDS	0.721	FBMNet (Ours)
2D Classification	nuScenes	mAP	0.689	FBMNet (Ours)
2D Object Detection	nuScenes	NDS	0.721	FBMNet (Ours)
2D Object Detection	nuScenes	mAP	0.689	FBMNet (Ours)
16k	nuScenes	NDS	0.721	FBMNet (Ours)
16k	nuScenes	mAP	0.689	FBMNet (Ours)

Multi-Modal 3D Object Detection by Box Matching

Abstract

Results

Related Papers

Multi-Modal 3D Object Detection by Box Matching

Abstract

Results

Related Papers