TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multimodal Virtual Point 3D Detection

Multimodal Virtual Point 3D Detection

Tianwei Yin, Xingyi Zhou, Philipp Krähenbühl

2021-11-12NeurIPS 2021 12Autonomous VehiclesAutonomous Driving3D Object Detection
PaperPDFCode(official)

Abstract

Lidar-based sensing drives current autonomous vehicles. Despite rapid progress, current Lidar sensors still lag two decades behind traditional color cameras in terms of resolution and cost. For autonomous driving, this means that large objects close to the sensors are easily visible, but far-away or small objects comprise only one measurement or two. This is an issue, especially when these objects turn out to be driving hazards. On the other hand, these same objects are clearly visible in onboard RGB sensors. In this work, we present an approach to seamlessly fuse RGB sensors into Lidar-based 3D recognition. Our approach takes a set of 2D detections to generate dense 3D virtual points to augment an otherwise sparse 3D point cloud. These virtual points naturally integrate into any standard Lidar-based 3D detectors along with regular Lidar measurements. The resulting multi-modal detector is simple and effective. Experimental results on the large-scale nuScenes dataset show that our framework improves a strong CenterPoint baseline by a significant 6.6 mAP, and outperforms competing fusion approaches. Code and more visualizations are available at https://tianweiy.github.io/mvp/

Results

TaskDatasetMetricValueModel
Object DetectionnuScenesNDS0.71MVP
Object DetectionnuScenesmAAE0.13MVP
Object DetectionnuScenesmAOE0.32MVP
Object DetectionnuScenesmAP0.66MVP
Object DetectionnuScenesmASE0.24MVP
Object DetectionnuScenesmATE0.26MVP
Object DetectionnuScenesmAVE0.31MVP
3DnuScenesNDS0.71MVP
3DnuScenesmAAE0.13MVP
3DnuScenesmAOE0.32MVP
3DnuScenesmAP0.66MVP
3DnuScenesmASE0.24MVP
3DnuScenesmATE0.26MVP
3DnuScenesmAVE0.31MVP
3D Object DetectionnuScenesNDS0.71MVP
3D Object DetectionnuScenesmAAE0.13MVP
3D Object DetectionnuScenesmAOE0.32MVP
3D Object DetectionnuScenesmAP0.66MVP
3D Object DetectionnuScenesmASE0.24MVP
3D Object DetectionnuScenesmATE0.26MVP
3D Object DetectionnuScenesmAVE0.31MVP
2D ClassificationnuScenesNDS0.71MVP
2D ClassificationnuScenesmAAE0.13MVP
2D ClassificationnuScenesmAOE0.32MVP
2D ClassificationnuScenesmAP0.66MVP
2D ClassificationnuScenesmASE0.24MVP
2D ClassificationnuScenesmATE0.26MVP
2D ClassificationnuScenesmAVE0.31MVP
2D Object DetectionnuScenesNDS0.71MVP
2D Object DetectionnuScenesmAAE0.13MVP
2D Object DetectionnuScenesmAOE0.32MVP
2D Object DetectionnuScenesmAP0.66MVP
2D Object DetectionnuScenesmASE0.24MVP
2D Object DetectionnuScenesmATE0.26MVP
2D Object DetectionnuScenesmAVE0.31MVP
16knuScenesNDS0.71MVP
16knuScenesmAAE0.13MVP
16knuScenesmAOE0.32MVP
16knuScenesmAP0.66MVP
16knuScenesmASE0.24MVP
16knuScenesmATE0.26MVP
16knuScenesmAVE0.31MVP

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16