Hanchen Wang, Qi Liu, Xiangyu Yue, Joan Lasenby, Matthew J. Kusner
We describe a simple pre-training approach for point clouds. It works in three steps: 1. Mask all points occluded in a camera view; 2. Learn an encoder-decoder model to reconstruct the occluded points; 3. Use the encoder weights as initialisation for downstream point cloud tasks. We find that even when we construct a single pre-training dataset (from ModelNet40), this pre-training method improves accuracy across different datasets and encoders, on a wide range of downstream tasks. Specifically, we show that our method outperforms previous pre-training methods in object classification, and both part-based and semantic segmentation tasks. We study the pre-trained features and find that they lead to wide downstream minima, have high transformation invariance, and have activations that are highly correlated with part labels. Code and data are available at: https://github.com/hansen7/OcCo
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Standard Deviation | 1.5 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Overall Accuracy | 86.5 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (20-shot) | Standard Deviation | 2.2 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Overall Accuracy | 90.6 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Standard Deviation | 2.8 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (10-shot) | Standard Deviation | 1.9 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ScanObjectNN 10-way (10-shot) | Overall Accuracy | 57 | OcCo |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Overall Accuracy | 83.9 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Standard Deviation | 1.8 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Overall Accuracy | 82.9 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 10-way (10-shot) | Standard Deviation | 1.3 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.5 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Standard Deviation | 1.9 | OcCo+DGCNN |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.4 | OcCo+PointNet |
| Shape Representation Of 3D Point Clouds | ModelNet40 5-way (20-shot) | Standard Deviation | 1.6 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Standard Deviation | 1.5 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Overall Accuracy | 86.5 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 10-way (20-shot) | Standard Deviation | 2.2 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Overall Accuracy | 90.6 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Standard Deviation | 2.8 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (10-shot) | Standard Deviation | 1.9 | OcCo+PointNet |
| 3D Point Cloud Classification | ScanObjectNN 10-way (10-shot) | Overall Accuracy | 57 | OcCo |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Overall Accuracy | 83.9 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Standard Deviation | 1.8 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Overall Accuracy | 82.9 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 10-way (10-shot) | Standard Deviation | 1.3 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.5 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Standard Deviation | 1.9 | OcCo+DGCNN |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.4 | OcCo+PointNet |
| 3D Point Cloud Classification | ModelNet40 5-way (20-shot) | Standard Deviation | 1.6 | OcCo+PointNet |
| Point Cloud Classification | PointCloud-C | mean Corruption Error (mCE) | 1.047 | OcCo-DGCNN |
| Point Cloud Segmentation | PointCloud-C | mean Corruption Error (mCE) | 0.977 | OcCo-DGCNN |
| Point Cloud Segmentation | PointCloud-C | mean Corruption Error (mCE) | 1.13 | OcCo-PointNet |
| Point Cloud Segmentation | PointCloud-C | mean Corruption Error (mCE) | 1.173 | OcCo-PCN |
| 3D Point Cloud Linear Classification | ModelNet40 | Overall Accuracy | 89.2 | OcCo |
| 3D Point Cloud Linear Classification | ScanObjectNN | Overall Accuracy | 78.3 | OcCo |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Standard Deviation | 1.5 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Overall Accuracy | 86.5 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (20-shot) | Standard Deviation | 2.2 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Overall Accuracy | 90.6 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Standard Deviation | 2.8 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Overall Accuracy | 89.7 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (10-shot) | Standard Deviation | 1.9 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ScanObjectNN 10-way (10-shot) | Overall Accuracy | 57 | OcCo |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Overall Accuracy | 83.9 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Standard Deviation | 1.8 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Overall Accuracy | 82.9 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 10-way (10-shot) | Standard Deviation | 1.3 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.5 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Standard Deviation | 1.9 | OcCo+DGCNN |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Overall Accuracy | 92.4 | OcCo+PointNet |
| 3D Point Cloud Reconstruction | ModelNet40 5-way (20-shot) | Standard Deviation | 1.6 | OcCo+PointNet |