Kangcheng Liu, Zhi Gao, Feng Lin, Ben M. Chen
This work presents FG-Net, a general deep learning framework for large-scale point clouds understanding without voxelizations, which achieves accurate and real-time performance with a single NVIDIA GTX 1080 GPU. First, a novel noise and outlier filtering method is designed to facilitate subsequent high-level tasks. For effective understanding purpose, we propose a deep convolutional neural network leveraging correlated feature mining and deformable convolution based geometric-aware modelling, in which the local feature relationships and geometric patterns can be fully exploited. For the efficiency issue, we put forward an inverse density sampling operation and a feature pyramid based residual learning strategy to save the computational cost and memory consumption respectively. Extensive experiments on real-world challenging datasets demonstrated that our approaches outperform state-of-the-art approaches in terms of accuracy and efficiency. Moreover, weakly supervised transfer learning is also conducted to demonstrate the generalization capacity of our method.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ScanNet | test mIoU | 69 | FG-Net |
| Semantic Segmentation | Semantic3D | oAcc | 93.6 | Feature Geometric Net |
| Semantic Segmentation | S3DIS | Mean IoU | 70.8 | Feature Geometric Net (FG-Net) |
| Semantic Segmentation | S3DIS | mAcc | 82.9 | Feature Geometric Net (FG-Net) |
| Semantic Segmentation | S3DIS | oAcc | 88.2 | Feature Geometric Net (FG-Net) |
| Semantic Segmentation | PartNet | mIOU | 58.2 | FG-Net |
| Semantic Segmentation | ShapeNet-Part | Class Average IoU | 87.7 | Feature Geometric Net (FG-Net) |
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 86.6 | Feature Geometric Net (FG-Net) |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Mean Accuracy | 91.1 | Feature Geometric Net (FG-Net) |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 93.8 | Feature Geometric Net (FG-Net) |
| 3D Semantic Segmentation | PartNet | mIOU | 58.2 | FG-Net |
| 3D Point Cloud Classification | ModelNet40 | Mean Accuracy | 91.1 | Feature Geometric Net (FG-Net) |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 93.8 | Feature Geometric Net (FG-Net) |
| LIDAR Semantic Segmentation | Paris-Lille-3D | mIOU | 0.819 | Feature Geometric Net (FG Net) |
| 10-shot image generation | ScanNet | test mIoU | 69 | FG-Net |
| 10-shot image generation | Semantic3D | oAcc | 93.6 | Feature Geometric Net |
| 10-shot image generation | S3DIS | Mean IoU | 70.8 | Feature Geometric Net (FG-Net) |
| 10-shot image generation | S3DIS | mAcc | 82.9 | Feature Geometric Net (FG-Net) |
| 10-shot image generation | S3DIS | oAcc | 88.2 | Feature Geometric Net (FG-Net) |
| 10-shot image generation | PartNet | mIOU | 58.2 | FG-Net |
| 10-shot image generation | ShapeNet-Part | Class Average IoU | 87.7 | Feature Geometric Net (FG-Net) |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 86.6 | Feature Geometric Net (FG-Net) |
| 3D Point Cloud Reconstruction | ModelNet40 | Mean Accuracy | 91.1 | Feature Geometric Net (FG-Net) |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 93.8 | Feature Geometric Net (FG-Net) |