Meng-Hao Guo, Jun-Xiong Cai, Zheng-Ning Liu, Tai-Jiang Mu, Ralph R. Martin, Shi-Min Hu
The irregular domain and lack of ordering make it challenging to design deep neural networks for point cloud processing. This paper presents a novel framework named Point Cloud Transformer(PCT) for point cloud learning. PCT is based on Transformer, which achieves huge success in natural language processing and displays great potential in image processing. It is inherently permutation invariant for processing a sequence of points, making it well-suited for point cloud learning. To better capture local context within the point cloud, we enhance input embedding with the support of farthest point sampling and nearest neighbor search. Extensive experiments demonstrate that the PCT achieves the state-of-the-art performance on shape classification, part segmentation and normal estimation tasks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ShapeNet-Part | Instance Average IoU | 86.4 | Point Cloud Transformer |
| Shape Representation Of 3D Point Clouds | IntrA | F1 score (5-fold) | 0.914 | PCT |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 93.2 | Point Cloud Transformer |
| Shape Representation Of 3D Point Clouds | ModelNet40-C | Error Rate | 0.255 | PCT |
| 3D Point Cloud Classification | IntrA | F1 score (5-fold) | 0.914 | PCT |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 93.2 | Point Cloud Transformer |
| 3D Point Cloud Classification | ModelNet40-C | Error Rate | 0.255 | PCT |
| Point Cloud Classification | PointCloud-C | mean Corruption Error (mCE) | 0.925 | PCT |
| 10-shot image generation | ShapeNet-Part | Instance Average IoU | 86.4 | Point Cloud Transformer |
| 3D Point Cloud Reconstruction | IntrA | F1 score (5-fold) | 0.914 | PCT |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 93.2 | Point Cloud Transformer |
| 3D Point Cloud Reconstruction | ModelNet40-C | Error Rate | 0.255 | PCT |