Hugues Thomas, Charles R. Qi, Jean-Emmanuel Deschaud, Beatriz Marcotegui, François Goulette, Leonidas J. Guibas
We present Kernel Point Convolution (KPConv), a new design of point convolution, i.e. that operates on point clouds without any intermediate representation. The convolution weights of KPConv are located in Euclidean space by kernel points, and applied to the input points close to them. Its capacity to use any number of kernel points gives KPConv more flexibility than fixed grid convolutions. Furthermore, these locations are continuous in space and can be learned by the network. Therefore, KPConv can be extended to deformable convolutions that learn to adapt kernel points to local geometry. Thanks to a regular subsampling strategy, KPConv is also efficient and robust to varying densities. Whether they use deformable KPConv for complex tasks, or rigid KPconv for simpler tasks, our networks outperform state-of-the-art classification and segmentation approaches on several datasets. We also offer ablation studies and visualizations to provide understanding of what has been learned by KPConv and to validate the descriptive power of deformable KPConv.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ScanNet | test mIoU | 68 | KpConv |
| Semantic Segmentation | ScanNet | val mIoU | 69.2 | KpConv |
| Semantic Segmentation | S3DIS Area5 | mAcc | 72.8 | KPConv |
| Semantic Segmentation | S3DIS Area5 | mIoU | 67.1 | KPConv |
| Semantic Segmentation | S3DIS | mAcc | 79.1 | KPConv |
| Semantic Segmentation | DALES | Overall Accuracy | 97.8 | KPConv |
| Semantic Segmentation | DALES | mIoU | 81.1 | KPConv |
| Semantic Segmentation | STPLS3D | mIOU | 53.73 | KpConv |
| Semantic Segmentation | SensatUrban | mIoU | 57.58 | KPConv |
| Semantic Segmentation | ScanNet | 3DIoU | 68.6 | KPConv |
| Semantic Segmentation | ShapeNet-Part | Class Average IoU | 85.1 | KPConv |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 86.4 | KPConv |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 92.9 | KPConv |
| 3D Semantic Segmentation | DALES | Overall Accuracy | 97.8 | KPConv |
| 3D Semantic Segmentation | DALES | mIoU | 81.1 | KPConv |
| 3D Semantic Segmentation | STPLS3D | mIOU | 53.73 | KpConv |
| 3D Semantic Segmentation | SensatUrban | mIoU | 57.58 | KPConv |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 92.9 | KPConv |
| LIDAR Semantic Segmentation | Paris-Lille-3D | mIOU | 0.759 | KPConv deform |
| Scene Segmentation | ScanNet | 3DIoU | 68.6 | KPConv |
| 10-shot image generation | ScanNet | test mIoU | 68 | KpConv |
| 10-shot image generation | ScanNet | val mIoU | 69.2 | KpConv |
| 10-shot image generation | S3DIS Area5 | mAcc | 72.8 | KPConv |
| 10-shot image generation | S3DIS Area5 | mIoU | 67.1 | KPConv |
| 10-shot image generation | S3DIS | mAcc | 79.1 | KPConv |
| 10-shot image generation | DALES | Overall Accuracy | 97.8 | KPConv |
| 10-shot image generation | DALES | mIoU | 81.1 | KPConv |
| 10-shot image generation | STPLS3D | mIOU | 53.73 | KpConv |
| 10-shot image generation | SensatUrban | mIoU | 57.58 | KPConv |
| 10-shot image generation | ScanNet | 3DIoU | 68.6 | KPConv |
| 10-shot image generation | ShapeNet-Part | Class Average IoU | 85.1 | KPConv |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 86.4 | KPConv |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 92.9 | KPConv |