Charles R. Qi, Li Yi, Hao Su, Leonidas J. Guibas
Few prior works study deep learning on point sets. PointNet by Qi et al. is a pioneer in this direction. However, by design PointNet does not capture local structures induced by the metric space points live in, limiting its ability to recognize fine-grained patterns and generalizability to complex scenes. In this work, we introduce a hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set. By exploiting metric space distances, our network is able to learn local features with increasing contextual scales. With further observation that point sets are usually sampled with varying densities, which results in greatly decreased performance for networks trained on uniform densities, we propose novel set learning layers to adaptively combine features from multiple scales. Experiments show that our network called PointNet++ is able to learn deep point set features efficiently and robustly. In particular, results significantly better than state-of-the-art have been obtained on challenging benchmarks of 3D point clouds.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Person Re-Identification | DukeMTMC-reID | Rank-1 | 60.23 | PointNet++ (MSG) [qi2017pointnet++] |
| Person Re-Identification | DukeMTMC-reID | mAP | 39.36 | PointNet++ (MSG) [qi2017pointnet++] |
| Semantic Segmentation | ScanNet | test mIoU | 33.9 | PointNet++ |
| Semantic Segmentation | ScanNet | val mIoU | 53.5 | PointNet++ |
| Semantic Segmentation | Toronto-3D L002 | mIoU | 56.5 | PointNet++ |
| Semantic Segmentation | Toronto-3D L002 | oAcc | 91.2 | PointNet++ |
| Semantic Segmentation | DALES | Overall Accuracy | 95.7 | PointNet++ |
| Semantic Segmentation | DALES | mIoU | 68.3 | PointNet++ |
| Semantic Segmentation | STPLS3D | mIOU | 15.92 | PointNet++ |
| Semantic Segmentation | KITTI-360 | mIoU Category | 58.28 | PointNet++ |
| Semantic Segmentation | KITTI-360 | miou | 35.66 | PointNet++ |
| Semantic Segmentation | IntrA | DSC (A) | 84.64 | PointNet++ |
| Semantic Segmentation | IntrA | DSC (V) | 96.48 | PointNet++ |
| Semantic Segmentation | IntrA | IoU (A) | 76.38 | PointNet++ |
| Semantic Segmentation | IntrA | IoU (V) | 93.42 | PointNet++ |
| Semantic Segmentation | ShapeNet-Part | Class Average IoU | 81.9 | PointNet++ |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 85.1 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 75.4 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | OBJ-BG (OA) | 82.3 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | OBJ-ONLY (OA) | 84.3 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 77.9 | PointNet++ |
| Shape Representation Of 3D Point Clouds | IntrA | F1 score (5-fold) | 0.903 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 90.7 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40-C | Error Rate | 0.236 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Overall Accuracy | 18.8 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Standard Deviation | 7 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Overall Accuracy | 38.53 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Standard Deviation | 16 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Overall Accuracy | 23.05 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Standard Deviation | 7 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Overall Accuracy | 42.39 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Standard Deviation | 14.2 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | GFLOPs | 1.7 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Number of params (M) | 1.5 | PointNet++ |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 77.9 | PointNet++ |
| 3D Semantic Segmentation | DALES | Overall Accuracy | 95.7 | PointNet++ |
| 3D Semantic Segmentation | DALES | mIoU | 68.3 | PointNet++ |
| 3D Semantic Segmentation | STPLS3D | mIOU | 15.92 | PointNet++ |
| 3D Semantic Segmentation | KITTI-360 | mIoU Category | 58.28 | PointNet++ |
| 3D Semantic Segmentation | KITTI-360 | miou | 35.66 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 75.4 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | OBJ-BG (OA) | 82.3 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | OBJ-ONLY (OA) | 84.3 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 77.9 | PointNet++ |
| 3D Point Cloud Classification | IntrA | F1 score (5-fold) | 0.903 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 90.7 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40-C | Error Rate | 0.236 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Overall Accuracy | 18.8 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Standard Deviation | 7 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Overall Accuracy | 38.53 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Standard Deviation | 16 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Overall Accuracy | 23.05 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Standard Deviation | 7 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Overall Accuracy | 42.39 | PointNet++ |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Standard Deviation | 14.2 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | GFLOPs | 1.7 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | Number of params (M) | 1.5 | PointNet++ |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 77.9 | PointNet++ |
| Point Cloud Segmentation | PointCloud-C | mean Corruption Error (mCE) | 1.112 | PointNet++ |
| 10-shot image generation | ScanNet | test mIoU | 33.9 | PointNet++ |
| 10-shot image generation | ScanNet | val mIoU | 53.5 | PointNet++ |
| 10-shot image generation | Toronto-3D L002 | mIoU | 56.5 | PointNet++ |
| 10-shot image generation | Toronto-3D L002 | oAcc | 91.2 | PointNet++ |
| 10-shot image generation | DALES | Overall Accuracy | 95.7 | PointNet++ |
| 10-shot image generation | DALES | mIoU | 68.3 | PointNet++ |
| 10-shot image generation | STPLS3D | mIOU | 15.92 | PointNet++ |
| 10-shot image generation | KITTI-360 | mIoU Category | 58.28 | PointNet++ |
| 10-shot image generation | KITTI-360 | miou | 35.66 | PointNet++ |
| 10-shot image generation | IntrA | DSC (A) | 84.64 | PointNet++ |
| 10-shot image generation | IntrA | DSC (V) | 96.48 | PointNet++ |
| 10-shot image generation | IntrA | IoU (A) | 76.38 | PointNet++ |
| 10-shot image generation | IntrA | IoU (V) | 93.42 | PointNet++ |
| 10-shot image generation | ShapeNet-Part | Class Average IoU | 81.9 | PointNet++ |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 85.1 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 75.4 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | OBJ-BG (OA) | 82.3 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | OBJ-ONLY (OA) | 84.3 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 77.9 | PointNet++ |
| 3D Point Cloud Reconstruction | IntrA | F1 score (5-fold) | 0.903 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 90.7 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40-C | Error Rate | 0.236 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Overall Accuracy | 18.8 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Standard Deviation | 7 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Overall Accuracy | 38.53 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Standard Deviation | 16 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Overall Accuracy | 23.05 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Standard Deviation | 7 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Overall Accuracy | 42.39 | PointNet++ |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Standard Deviation | 14.2 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | GFLOPs | 1.7 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | Number of params (M) | 1.5 | PointNet++ |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy (PB_T50_RS) | 77.9 | PointNet++ |