Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, Yu Qiao
Deep neural networks have enjoyed remarkable success for various vision tasks, however it remains challenging to apply CNNs to domains lacking a regular underlying structures such as 3D point clouds. Towards this we propose a novel convolutional architecture, termed SpiderCNN, to efficiently extract geometric features from point clouds. SpiderCNN is comprised of units called SpiderConv, which extend convolutional operations from regular grids to irregular point sets that can be embedded in R^n, by parametrizing a family of convolutional filters. We design the filter as a product of a simple step function that captures local geodesic information and a Taylor polynomial that ensures the expressiveness. SpiderCNN inherits the multi-scale hierarchical architecture from classical CNNs, which allows it to extract semantic deep features. Experiments on ModelNet40 demonstrate that SpiderCNN achieves state-of-the-art accuracy 92.4% on standard benchmarks, and shows competitive performance on segmentation task.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | IntrA | DSC (A) | 75.82 | SpiderCNN |
| Semantic Segmentation | IntrA | DSC (V) | 94.53 | SpiderCNN |
| Semantic Segmentation | IntrA | IoU (A) | 67.25 | SpiderCNN |
| Semantic Segmentation | IntrA | IoU (V) | 90.16 | SpiderCNN |
| Semantic Segmentation | ShapeNet-Part | Class Average IoU | 82.4 | SpiderCNN |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 85.3 | SpiderCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 69.8 | SpiderCNN |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 73.7 | SpiderCNN |
| Shape Representation Of 3D Point Clouds | IntrA | F1 score (5-fold) | 0.872 | SpiderCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 92.4 | SpiderCNN |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 69.8 | SpiderCNN |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 73.7 | SpiderCNN |
| 3D Point Cloud Classification | IntrA | F1 score (5-fold) | 0.872 | SpiderCNN |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 92.4 | SpiderCNN |
| 10-shot image generation | IntrA | DSC (A) | 75.82 | SpiderCNN |
| 10-shot image generation | IntrA | DSC (V) | 94.53 | SpiderCNN |
| 10-shot image generation | IntrA | IoU (A) | 67.25 | SpiderCNN |
| 10-shot image generation | IntrA | IoU (V) | 90.16 | SpiderCNN |
| 10-shot image generation | ShapeNet-Part | Class Average IoU | 82.4 | SpiderCNN |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 85.3 | SpiderCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 69.8 | SpiderCNN |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 73.7 | SpiderCNN |
| 3D Point Cloud Reconstruction | IntrA | F1 score (5-fold) | 0.872 | SpiderCNN |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 92.4 | SpiderCNN |