Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas
Point cloud is an important type of geometric data structure. Due to its irregular format, most researchers transform such data to regular 3D voxel grids or collections of images. This, however, renders data unnecessarily voluminous and causes issues. In this paper, we design a novel type of neural network that directly consumes point clouds and well respects the permutation invariance of points in the input. Our network, named PointNet, provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing. Though simple, PointNet is highly efficient and effective. Empirically, it shows strong performance on par or even better than state of the art. Theoretically, we provide analysis towards understanding of what the network has learnt and why the network is robust with respect to input perturbation and corruption.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | S3DIS Area5 | mAcc | 49 | PointNet |
| Semantic Segmentation | S3DIS | mAcc | 66.2 | PointNet |
| Semantic Segmentation | KITTI-360 | mIoU Category | 30.42 | PointNet |
| Semantic Segmentation | KITTI-360 | miou | 13.07 | PointNet |
| Semantic Segmentation | IntrA | DSC (A) | 49.59 | PointNet |
| Semantic Segmentation | IntrA | DSC (V) | 85 | PointNet |
| Semantic Segmentation | IntrA | IoU (A) | 37.75 | PointNet |
| Semantic Segmentation | IntrA | IoU (V) | 75.23 | PointNet |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 83.7 | PointNet |
| Object Detection | nuScenes | mAAE | 0.5 | PointNet |
| 3D | nuScenes | mAAE | 0.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 63.4 | PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 68.2 | PointNet |
| Shape Representation Of 3D Point Clouds | IntrA | F1 score (5-fold) | 0.684 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Mean Accuracy | 86 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 89.2 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40-C | Error Rate | 0.283 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Overall Accuracy | 35.2 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Standard Deviation | 13.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Overall Accuracy | 51.97 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Standard Deviation | 12.1 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Overall Accuracy | 46.6 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Standard Deviation | 13.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Overall Accuracy | 57.81 | PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Standard Deviation | 15.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | GFLOPs | 0.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Number of params (M) | 3.5 | PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 68 | PointNet |
| 3D Semantic Segmentation | KITTI-360 | mIoU Category | 30.42 | PointNet |
| 3D Semantic Segmentation | KITTI-360 | miou | 13.07 | PointNet |
| 3D Object Detection | nuScenes | mAAE | 0.5 | PointNet |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 63.4 | PointNet |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 68.2 | PointNet |
| 3D Point Cloud Classification | IntrA | F1 score (5-fold) | 0.684 | PointNet |
| 3D Point Cloud Classification | ModelNet40 | Mean Accuracy | 86 | PointNet |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 89.2 | PointNet |
| 3D Point Cloud Classification | ModelNet40-C | Error Rate | 0.283 | PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Overall Accuracy | 35.2 | PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Standard Deviation | 13.5 | PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Overall Accuracy | 51.97 | PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Standard Deviation | 12.1 | PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Overall Accuracy | 46.6 | PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Standard Deviation | 13.5 | PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Overall Accuracy | 57.81 | PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Standard Deviation | 15.5 | PointNet |
| 3D Point Cloud Classification | ScanObjectNN | GFLOPs | 0.5 | PointNet |
| 3D Point Cloud Classification | ScanObjectNN | Number of params (M) | 3.5 | PointNet |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 68 | PointNet |
| Point Cloud Classification | PointCloud-C | mean Corruption Error (mCE) | 1.422 | PointNet |
| Point Cloud Segmentation | PointCloud-C | mean Corruption Error (mCE) | 1.178 | PointNet |
| 2D Classification | nuScenes | mAAE | 0.5 | PointNet |
| 2D Object Detection | nuScenes | mAAE | 0.5 | PointNet |
| 10-shot image generation | S3DIS Area5 | mAcc | 49 | PointNet |
| 10-shot image generation | S3DIS | mAcc | 66.2 | PointNet |
| 10-shot image generation | KITTI-360 | mIoU Category | 30.42 | PointNet |
| 10-shot image generation | KITTI-360 | miou | 13.07 | PointNet |
| 10-shot image generation | IntrA | DSC (A) | 49.59 | PointNet |
| 10-shot image generation | IntrA | DSC (V) | 85 | PointNet |
| 10-shot image generation | IntrA | IoU (A) | 37.75 | PointNet |
| 10-shot image generation | IntrA | IoU (V) | 75.23 | PointNet |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 83.7 | PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 63.4 | PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 68.2 | PointNet |
| 3D Point Cloud Reconstruction | IntrA | F1 score (5-fold) | 0.684 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 | Mean Accuracy | 86 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 89.2 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40-C | Error Rate | 0.283 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Overall Accuracy | 35.2 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Standard Deviation | 13.5 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Overall Accuracy | 51.97 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Standard Deviation | 12.1 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Overall Accuracy | 46.6 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Standard Deviation | 13.5 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Overall Accuracy | 57.81 | PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Standard Deviation | 15.5 | PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN | GFLOPs | 0.5 | PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN | Number of params (M) | 3.5 | PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 68 | PointNet |
| 16k | nuScenes | mAAE | 0.5 | PointNet |