PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

Xiangdong Zhang, Shaofeng Zhang, Junchi Yan

2024-08-16Learning Semantic Representations Few-Shot Learning Self-Supervised Learning Few-Shot 3D Point Cloud Classification 3D Object Classification 3D Point Cloud Classification

Paper PDF Code(official)

Abstract

Masked autoencoder has been widely explored in point cloud self-supervised learning, whereby the point cloud is generally divided into visible and masked parts. These methods typically include an encoder accepting visible patches (normalized) and corresponding patch centers (position) as input, with the decoder accepting the output of the encoder and the centers (position) of the masked parts to reconstruct each point in the masked patches. Then, the pre-trained encoders are used for downstream tasks. In this paper, we show a motivating empirical result that when directly feeding the centers of masked patches to the decoder without information from the encoder, it still reconstructs well. In other words, the centers of patches are important and the reconstruction objective does not necessarily rely on representations of the encoder, thus preventing the encoder from learning semantic representations. Based on this key observation, we propose a simple yet effective method, i.e., learning to Predict Centers for Point Masked AutoEncoders (PCP-MAE) which guides the model to learn to predict the significant centers and use the predicted centers to replace the directly provided centers. Specifically, we propose a Predicting Center Module (PCM) that shares parameters with the original encoder with extra cross-attention to predict centers. Our method is of high pre-training efficiency compared to other alternatives and achieves great improvement over Point-MAE, particularly surpassing it by 5.50% on OBJ-BG, 6.03% on OBJ-ONLY, and 5.17% on PB-T50-RS for 3D object classification on the ScanObjectNN dataset. The code is available at https://github.com/aHapBean/PCP-MAE.

Results

Task	Dataset	Metric	Value	Model
Shape Representation Of 3D Point Clouds	ScanObjectNN	OBJ-BG (OA)	95.52	PCP-MAE
Shape Representation Of 3D Point Clouds	ScanObjectNN	OBJ-ONLY (OA)	94.32	PCP-MAE
Shape Representation Of 3D Point Clouds	ScanObjectNN	Overall Accuracy	90.35	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40	Overall Accuracy	94.2	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 10-way (20-shot)	Overall Accuracy	95.9	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 10-way (20-shot)	Standard Deviation	2.7	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 5-way (10-shot)	Overall Accuracy	97.4	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 5-way (10-shot)	Standard Deviation	2.3	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 10-way (10-shot)	Overall Accuracy	93.5	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 10-way (10-shot)	Standard Deviation	3.7	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 5-way (20-shot)	Overall Accuracy	99.1	PCP-MAE
Shape Representation Of 3D Point Clouds	ModelNet40 5-way (20-shot)	Standard Deviation	0.8	PCP-MAE
3D Point Cloud Classification	ScanObjectNN	OBJ-BG (OA)	95.52	PCP-MAE
3D Point Cloud Classification	ScanObjectNN	OBJ-ONLY (OA)	94.32	PCP-MAE
3D Point Cloud Classification	ScanObjectNN	Overall Accuracy	90.35	PCP-MAE
3D Point Cloud Classification	ModelNet40	Overall Accuracy	94.2	PCP-MAE
3D Point Cloud Classification	ModelNet40 10-way (20-shot)	Overall Accuracy	95.9	PCP-MAE
3D Point Cloud Classification	ModelNet40 10-way (20-shot)	Standard Deviation	2.7	PCP-MAE
3D Point Cloud Classification	ModelNet40 5-way (10-shot)	Overall Accuracy	97.4	PCP-MAE
3D Point Cloud Classification	ModelNet40 5-way (10-shot)	Standard Deviation	2.3	PCP-MAE
3D Point Cloud Classification	ModelNet40 10-way (10-shot)	Overall Accuracy	93.5	PCP-MAE
3D Point Cloud Classification	ModelNet40 10-way (10-shot)	Standard Deviation	3.7	PCP-MAE
3D Point Cloud Classification	ModelNet40 5-way (20-shot)	Overall Accuracy	99.1	PCP-MAE
3D Point Cloud Classification	ModelNet40 5-way (20-shot)	Standard Deviation	0.8	PCP-MAE
3D Point Cloud Reconstruction	ScanObjectNN	OBJ-BG (OA)	95.52	PCP-MAE
3D Point Cloud Reconstruction	ScanObjectNN	OBJ-ONLY (OA)	94.32	PCP-MAE
3D Point Cloud Reconstruction	ScanObjectNN	Overall Accuracy	90.35	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40	Overall Accuracy	94.2	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 10-way (20-shot)	Overall Accuracy	95.9	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 10-way (20-shot)	Standard Deviation	2.7	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 5-way (10-shot)	Overall Accuracy	97.4	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 5-way (10-shot)	Standard Deviation	2.3	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 10-way (10-shot)	Overall Accuracy	93.5	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 10-way (10-shot)	Standard Deviation	3.7	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 5-way (20-shot)	Overall Accuracy	99.1	PCP-MAE
3D Point Cloud Reconstruction	ModelNet40 5-way (20-shot)	Standard Deviation	0.8	PCP-MAE

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

Abstract

Results

Related Papers

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

Abstract

Results

Related Papers