Yahui Liu, Bin Tian, Yisheng Lv, Lingxi Li, FeiYue Wang
Recently, there have been some attempts of Transformer in 3D point cloud classification. In order to reduce computations, most existing methods focus on local spatial attention, but ignore their content and fail to establish relationships between distant but relevant points. To overcome the limitation of local spatial attention, we propose a point content-based Transformer architecture, called PointConT for short. It exploits the locality of points in the feature space (content-based), which clusters the sampled points with similar features into the same class and computes the self-attention within each class, thus enabling an effective trade-off between capturing long-range dependencies and computational complexity. We further introduce an Inception feature aggregator for point cloud classification, which uses parallel structures to aggregate high-frequency and low-frequency information in each branch separately. Extensive experiments show that our PointConT model achieves a remarkable performance on point cloud shape classification. Especially, our method exhibits 90.3% Top-1 accuracy on the hardest setting of ScanObjectNN. Source code of this paper is available at https://github.com/yahuiliu99/PointConT.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 88.5 | PointConT |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 90.3 | PointConT |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Mean Accuracy | 86 | PointConT (no voting) |
| Shape Representation Of 3D Point Clouds | ScanObjectNN | Overall Accuracy | 88 | PointConT (no voting) |
| Shape Representation Of 3D Point Clouds | ModelNet40 | Overall Accuracy | 93.5 | PointConT |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 88.5 | PointConT |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 90.3 | PointConT |
| 3D Point Cloud Classification | ScanObjectNN | Mean Accuracy | 86 | PointConT (no voting) |
| 3D Point Cloud Classification | ScanObjectNN | Overall Accuracy | 88 | PointConT (no voting) |
| 3D Point Cloud Classification | ModelNet40 | Overall Accuracy | 93.5 | PointConT |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 88.5 | PointConT |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 90.3 | PointConT |
| 3D Point Cloud Reconstruction | ScanObjectNN | Mean Accuracy | 86 | PointConT (no voting) |
| 3D Point Cloud Reconstruction | ScanObjectNN | Overall Accuracy | 88 | PointConT (no voting) |
| 3D Point Cloud Reconstruction | ModelNet40 | Overall Accuracy | 93.5 | PointConT |