TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Masked Discrimination for Self-Supervised Learning on Poin...

Masked Discrimination for Self-Supervised Learning on Point Clouds

Haotian Liu, Mu Cai, Yong Jae Lee

2022-03-21Binary ClassificationSelf-Supervised LearningFew-Shot 3D Point Cloud Classificationobject-detection3D Shape ClassificationObject Detection
PaperPDFCode(official)Code

Abstract

Masked autoencoding has achieved great success for self-supervised learning in the image and language domains. However, mask based pretraining has yet to show benefits for point cloud understanding, likely due to standard backbones like PointNet being unable to properly handle the training versus testing distribution mismatch introduced by masking during training. In this paper, we bridge this gap by proposing a discriminative mask pretraining Transformer framework, MaskPoint}, for point clouds. Our key idea is to represent the point cloud as discrete occupancy values (1 if part of the point cloud; 0 if not), and perform simple binary classification between masked object points and sampled noise points as the proxy task. In this way, our approach is robust to the point sampling variance in point clouds, and facilitates learning rich representations. We evaluate our pretrained models across several downstream tasks, including 3D shape classification, segmentation, and real-word object detection, and demonstrate state-of-the-art results while achieving a significant pretraining speedup (e.g., 4.1x on ScanNet) compared to the prior state-of-the-art Transformer baseline. Code is available at https://github.com/haotian-liu/MaskPoint.

Results

TaskDatasetMetricValueModel
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Overall Accuracy93.4MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Standard Deviation3.5MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Overall Accuracy95MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Standard Deviation3.7MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Overall Accuracy91.4MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Standard Deviation4MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Overall Accuracy97.2MaskPoint
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Standard Deviation1.7MaskPoint
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Overall Accuracy93.4MaskPoint
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Standard Deviation3.5MaskPoint
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Overall Accuracy95MaskPoint
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Standard Deviation3.7MaskPoint
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Overall Accuracy91.4MaskPoint
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Standard Deviation4MaskPoint
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Overall Accuracy97.2MaskPoint
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Standard Deviation1.7MaskPoint
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Overall Accuracy93.4MaskPoint
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Standard Deviation3.5MaskPoint
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Overall Accuracy95MaskPoint
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Standard Deviation3.7MaskPoint
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Overall Accuracy91.4MaskPoint
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Standard Deviation4MaskPoint
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Overall Accuracy97.2MaskPoint
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Standard Deviation1.7MaskPoint

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14