Yangyan Li, Rui Bu, Mingchao Sun, Wei Wu, Xinhan Di, Baoquan Chen
We present a simple and general framework for feature learning from point clouds. The key to the success of CNNs is the convolution operator that is capable of leveraging spatially-local correlation in data represented densely in grids (e.g. images). However, point clouds are irregular and unordered, thus directly convolving kernels against features associated with the points, will result in desertion of shape information and variance to point ordering. To address these problems, we propose to learn an $\mathcal{X}$-transformation from the input points, to simultaneously promote two causes. The first is the weighting of the input features associated with the points, and the second is the permutation of the points into a latent and potentially canonical order. Element-wise product and sum operations of the typical convolution operator are subsequently applied on the $\mathcal{X}$-transformed features. The proposed method is a generalization of typical CNNs to feature learning from point clouds, thus we call it PointCNN. Experiments show that PointCNN achieves on par or better performance than state-of-the-art methods on multiple challenging benchmark datasets and tasks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | IntrA | DSC (A) | 81.74 | PointCNN |
| Semantic Segmentation | IntrA | DSC (V) | 96.62 | PointCNN |
| Semantic Segmentation | IntrA | IoU (A) | 74.11 | PointCNN |
| Semantic Segmentation | IntrA | IoU (V) | 93.59 | PointCNN |
| Semantic Segmentation | ShapeNet-Part | Class Average IoU | 84.6 | PointCNN |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 86.14 | PointCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 75.1 | PointCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | OBJ-BG (OA) | 86.1 | PointCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | OBJ-ONLY (OA) | 85.5 | PointCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 78.5 | PointCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Overall Accuracy | 65.41 | PointCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Standard Deviation | 8.9 | PointCNN |
| Instance Segmentation | S3DIS | mAcc | 75.61 | PointCNN |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 75.1 | PointCNN |
| 3D Point Cloud Classification | ScanObjectNN | OBJ-BG (OA) | 86.1 | PointCNN |
| 3D Point Cloud Classification | ScanObjectNN | OBJ-ONLY (OA) | 85.5 | PointCNN |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 78.5 | PointCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Overall Accuracy | 65.41 | PointCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Standard Deviation | 8.9 | PointCNN |
| 10-shot image generation | IntrA | DSC (A) | 81.74 | PointCNN |
| 10-shot image generation | IntrA | DSC (V) | 96.62 | PointCNN |
| 10-shot image generation | IntrA | IoU (A) | 74.11 | PointCNN |
| 10-shot image generation | IntrA | IoU (V) | 93.59 | PointCNN |
| 10-shot image generation | ShapeNet-Part | Class Average IoU | 84.6 | PointCNN |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 86.14 | PointCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 75.1 | PointCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | OBJ-BG (OA) | 86.1 | PointCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | OBJ-ONLY (OA) | 85.5 | PointCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 78.5 | PointCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Overall Accuracy | 65.41 | PointCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Standard Deviation | 8.9 | PointCNN |
| 3D Instance Segmentation | S3DIS | mAcc | 75.61 | PointCNN |