Hugues Thomas, Yao-Hung Hubert Tsai, Timothy D. Barfoot, Jian Zhang
In the field of deep point cloud understanding, KPConv is a unique architecture that uses kernel points to locate convolutional weights in space, instead of relying on Multi-Layer Perceptron (MLP) encodings. While it initially achieved success, it has since been surpassed by recent MLP networks that employ updated designs and training strategies. Building upon the kernel point principle, we present two novel designs: KPConvD (depthwise KPConv), a lighter design that enables the use of deeper architectures, and KPConvX, an innovative design that scales the depthwise convolutional weights of KPConvD with kernel attention values. Using KPConvX with a modern architecture and training strategy, we are able to outperform current state-of-the-art approaches on the ScanObjectNN, Scannetv2, and S3DIS datasets. We validate our design choices through ablation studies and release our code and models.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ScanNet | val mIoU | 76.3 | KPConvX-L |
| Semantic Segmentation | S3DIS Area5 | mAcc | 78.7 | KPConvX-L |
| Semantic Segmentation | S3DIS Area5 | mIoU | 73.5 | KPConvX-L |
| Semantic Segmentation | S3DIS Area5 | oAcc | 91.7 | KPConvX-L |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 88.1 | KPConvX-L |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 89.3 | KPConvX-L |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 88.1 | KPConvX-L |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 89.3 | KPConvX-L |
| 10-shot image generation | ScanNet | val mIoU | 76.3 | KPConvX-L |
| 10-shot image generation | S3DIS Area5 | mAcc | 78.7 | KPConvX-L |
| 10-shot image generation | S3DIS Area5 | mIoU | 73.5 | KPConvX-L |
| 10-shot image generation | S3DIS Area5 | oAcc | 91.7 | KPConvX-L |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 88.1 | KPConvX-L |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 89.3 | KPConvX-L |