TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PointGPT: Auto-regressively Generative Pre-training from P...

PointGPT: Auto-regressively Generative Pre-training from Point Clouds

Guangyan Chen, Meiling Wang, Yi Yang, Kai Yu, Li Yuan, Yufeng Yue

2023-05-19NeurIPS 2023 11Few-Shot LearningFew-Shot 3D Point Cloud Classification3D Point Cloud Classification
PaperPDFCode(official)

Abstract

Large language models (LLMs) based on the generative pre-training transformer (GPT) have demonstrated remarkable effectiveness across a diverse range of downstream tasks. Inspired by the advancements of the GPT, we present PointGPT, a novel approach that extends the concept of GPT to point clouds, addressing the challenges associated with disorder properties, low information density, and task gaps. Specifically, a point cloud auto-regressive generation task is proposed to pre-train transformer models. Our method partitions the input point cloud into multiple point patches and arranges them in an ordered sequence based on their spatial proximity. Then, an extractor-generator based transformer decoder, with a dual masking strategy, learns latent representations conditioned on the preceding point patches, aiming to predict the next one in an auto-regressive manner. Our scalable approach allows for learning high-capacity models that generalize well, achieving state-of-the-art performance on various downstream tasks. In particular, our approach achieves classification accuracies of 94.9% on the ModelNet40 dataset and 93.4% on the ScanObjectNN dataset, outperforming all other transformer models. Furthermore, our method also attains new state-of-the-art accuracies on all four few-shot learning benchmarks.

Results

TaskDatasetMetricValueModel
Shape Representation Of 3D Point CloudsScanObjectNNOBJ-BG (OA)97.2PointGPT
Shape Representation Of 3D Point CloudsScanObjectNNOBJ-ONLY (OA)96.6PointGPT
Shape Representation Of 3D Point CloudsScanObjectNNOverall Accuracy93.4PointGPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Overall Accuracy96.1PointGPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Standard Deviation2.8PointGPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Overall Accuracy98PointGPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Standard Deviation1.9PointGPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Overall Accuracy94.3PointGPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Standard Deviation3.3PointGPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Overall Accuracy99PointGPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Standard Deviation1PointGPT
3D Point Cloud ClassificationScanObjectNNOBJ-BG (OA)97.2PointGPT
3D Point Cloud ClassificationScanObjectNNOBJ-ONLY (OA)96.6PointGPT
3D Point Cloud ClassificationScanObjectNNOverall Accuracy93.4PointGPT
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Overall Accuracy96.1PointGPT
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Standard Deviation2.8PointGPT
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Overall Accuracy98PointGPT
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Standard Deviation1.9PointGPT
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Overall Accuracy94.3PointGPT
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Standard Deviation3.3PointGPT
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Overall Accuracy99PointGPT
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Standard Deviation1PointGPT
3D Point Cloud ReconstructionScanObjectNNOBJ-BG (OA)97.2PointGPT
3D Point Cloud ReconstructionScanObjectNNOBJ-ONLY (OA)96.6PointGPT
3D Point Cloud ReconstructionScanObjectNNOverall Accuracy93.4PointGPT
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Overall Accuracy96.1PointGPT
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Standard Deviation2.8PointGPT
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Overall Accuracy98PointGPT
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Standard Deviation1.9PointGPT
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Overall Accuracy94.3PointGPT
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Standard Deviation3.3PointGPT
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Overall Accuracy99PointGPT
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Standard Deviation1PointGPT

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10An Enhanced Privacy-preserving Federated Few-shot Learning Framework for Respiratory Disease Diagnosis2025-07-10Few-Shot Learning by Explicit Physics Integration: An Application to Groundwater Heat Transport2025-07-08ViRefSAM: Visual Reference-Guided Segment Anything Model for Remote Sensing Segmentation2025-07-03Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning2025-06-26Dynamic Context-Aware Prompt Recommendation for Domain-Specific AI Applications2025-06-25Ancient Script Image Recognition and Processing: A Review2025-06-24