TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object...

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang, Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

2019-12-31CVPR 2020 6object-detectionRobust 3D Object Detection3D Object DetectionObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)

Abstract

We present a novel and high-performance 3D object detection framework, named PointVoxel-RCNN (PV-RCNN), for accurate 3D object detection from point clouds. Our proposed method deeply integrates both 3D voxel Convolutional Neural Network (CNN) and PointNet-based set abstraction to learn more discriminative point cloud features. It takes advantages of efficient learning and high-quality proposals of the 3D voxel CNN and the flexible receptive fields of the PointNet-based networks. Specifically, the proposed framework summarizes the 3D scene with a 3D voxel CNN into a small set of keypoints via a novel voxel set abstraction module to save follow-up computations and also to encode representative scene features. Given the high-quality 3D proposals generated by the voxel CNN, the RoI-grid pooling is proposed to abstract proposal-specific features from the keypoints to the RoI-grid points via keypoint set abstraction with multiple receptive fields. Compared with conventional pooling operations, the RoI-grid feature points encode much richer context information for accurately estimating object confidences and locations. Extensive experiments on both the KITTI dataset and the Waymo Open dataset show that our proposed PV-RCNN surpasses state-of-the-art 3D detection methods with remarkable margins by using only point clouds. Code is available at https://github.com/open-mmlab/OpenPCDet.

Results

TaskDatasetMetricValueModel
Object Detectionwaymo all_nsAPH/L271.52PV-RCNN
Object Detectionwaymo cyclistAPH/L271.16PV-RCNN
Object Detectionwaymo vehicleAPH/L273.23PV-RCNN
Object Detectionwaymo pedestrianAPH/L270.16PV-RCNN
3Dwaymo all_nsAPH/L271.52PV-RCNN
3Dwaymo cyclistAPH/L271.16PV-RCNN
3Dwaymo vehicleAPH/L273.23PV-RCNN
3Dwaymo pedestrianAPH/L270.16PV-RCNN
Birds Eye View Object DetectionKITTI Cyclists EasyAP82.49PV-RCNN
Birds Eye View Object DetectionKITTI Cars HardAP86.14PV-RCNN
Birds Eye View Object DetectionKITTI Cars EasyAP94.98PV-RCNN
Birds Eye View Object DetectionKITTI Cyclists HardAP62.41PV-RCNN
3D Object Detectionwaymo all_nsAPH/L271.52PV-RCNN
3D Object Detectionwaymo cyclistAPH/L271.16PV-RCNN
3D Object Detectionwaymo vehicleAPH/L273.23PV-RCNN
3D Object Detectionwaymo pedestrianAPH/L270.16PV-RCNN
2D Classificationwaymo all_nsAPH/L271.52PV-RCNN
2D Classificationwaymo cyclistAPH/L271.16PV-RCNN
2D Classificationwaymo vehicleAPH/L273.23PV-RCNN
2D Classificationwaymo pedestrianAPH/L270.16PV-RCNN
2D Object Detectionwaymo all_nsAPH/L271.52PV-RCNN
2D Object Detectionwaymo cyclistAPH/L271.16PV-RCNN
2D Object Detectionwaymo vehicleAPH/L273.23PV-RCNN
2D Object Detectionwaymo pedestrianAPH/L270.16PV-RCNN
16kwaymo all_nsAPH/L271.52PV-RCNN
16kwaymo cyclistAPH/L271.16PV-RCNN
16kwaymo vehicleAPH/L273.23PV-RCNN
16kwaymo pedestrianAPH/L270.16PV-RCNN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07