TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PointNet: Deep Learning on Point Sets for 3D Classificatio...

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation

Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas

2016-12-02CVPR 2017 7Semantic ParsingSkeleton Based Action RecognitionScene SegmentationSemantic SegmentationFew-Shot 3D Point Cloud ClassificationSupervised Only 3D Point Cloud ClassificationPoint Cloud Segmentation3D Semantic Segmentation3D Face Reconstruction
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Point cloud is an important type of geometric data structure. Due to its irregular format, most researchers transform such data to regular 3D voxel grids or collections of images. This, however, renders data unnecessarily voluminous and causes issues. In this paper, we design a novel type of neural network that directly consumes point clouds and well respects the permutation invariance of points in the input. Our network, named PointNet, provides a unified architecture for applications ranging from object classification, part segmentation, to scene semantic parsing. Though simple, PointNet is highly efficient and effective. Empirically, it shows strong performance on par or even better than state of the art. Theoretically, we provide analysis towards understanding of what the network has learnt and why the network is robust with respect to input perturbation and corruption.

Results

TaskDatasetMetricValueModel
Semantic SegmentationS3DIS Area5mAcc49PointNet
Semantic SegmentationS3DISmAcc66.2PointNet
Semantic SegmentationKITTI-360mIoU Category30.42PointNet
Semantic SegmentationKITTI-360miou13.07PointNet
Semantic SegmentationIntrADSC (A)49.59PointNet
Semantic SegmentationIntrADSC (V)85PointNet
Semantic SegmentationIntrAIoU (A)37.75PointNet
Semantic SegmentationIntrAIoU (V)75.23PointNet
Semantic SegmentationShapeNet-PartInstance Average IoU83.7PointNet
Object DetectionnuScenesmAAE0.5PointNet
3DnuScenesmAAE0.5PointNet
Shape Representation Of 3D Point CloudsScanObjectNNMean Accuracy63.4PointNet
Shape Representation Of 3D Point CloudsScanObjectNNOverall Accuracy68.2PointNet
Shape Representation Of 3D Point CloudsIntrAF1 score (5-fold)0.684PointNet
Shape Representation Of 3D Point CloudsModelNet40Mean Accuracy86PointNet
Shape Representation Of 3D Point CloudsModelNet40Overall Accuracy89.2PointNet
Shape Representation Of 3D Point CloudsModelNet40-CError Rate0.283PointNet
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Overall Accuracy35.2PointNet
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Standard Deviation13.5PointNet
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Overall Accuracy51.97PointNet
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Standard Deviation12.1PointNet
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Overall Accuracy46.6PointNet
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Standard Deviation13.5PointNet
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Overall Accuracy57.81PointNet
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Standard Deviation15.5PointNet
Shape Representation Of 3D Point CloudsScanObjectNNGFLOPs0.5PointNet
Shape Representation Of 3D Point CloudsScanObjectNNNumber of params (M)3.5PointNet
Shape Representation Of 3D Point CloudsScanObjectNNOverall Accuracy (PB_T50_RS)68PointNet
3D Semantic SegmentationKITTI-360mIoU Category30.42PointNet
3D Semantic SegmentationKITTI-360miou13.07PointNet
3D Object DetectionnuScenesmAAE0.5PointNet
3D Point Cloud ClassificationScanObjectNNMean Accuracy63.4PointNet
3D Point Cloud ClassificationScanObjectNNOverall Accuracy68.2PointNet
3D Point Cloud ClassificationIntrAF1 score (5-fold)0.684PointNet
3D Point Cloud ClassificationModelNet40Mean Accuracy86PointNet
3D Point Cloud ClassificationModelNet40Overall Accuracy89.2PointNet
3D Point Cloud ClassificationModelNet40-CError Rate0.283PointNet
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Overall Accuracy35.2PointNet
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Standard Deviation13.5PointNet
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Overall Accuracy51.97PointNet
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Standard Deviation12.1PointNet
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Overall Accuracy46.6PointNet
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Standard Deviation13.5PointNet
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Overall Accuracy57.81PointNet
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Standard Deviation15.5PointNet
3D Point Cloud ClassificationScanObjectNNGFLOPs0.5PointNet
3D Point Cloud ClassificationScanObjectNNNumber of params (M)3.5PointNet
3D Point Cloud ClassificationScanObjectNNOverall Accuracy (PB_T50_RS)68PointNet
Point Cloud ClassificationPointCloud-Cmean Corruption Error (mCE)1.422PointNet
Point Cloud SegmentationPointCloud-Cmean Corruption Error (mCE)1.178PointNet
2D ClassificationnuScenesmAAE0.5PointNet
2D Object DetectionnuScenesmAAE0.5PointNet
10-shot image generationS3DIS Area5mAcc49PointNet
10-shot image generationS3DISmAcc66.2PointNet
10-shot image generationKITTI-360mIoU Category30.42PointNet
10-shot image generationKITTI-360miou13.07PointNet
10-shot image generationIntrADSC (A)49.59PointNet
10-shot image generationIntrADSC (V)85PointNet
10-shot image generationIntrAIoU (A)37.75PointNet
10-shot image generationIntrAIoU (V)75.23PointNet
10-shot image generationShapeNet-PartInstance Average IoU83.7PointNet
3D Point Cloud ReconstructionScanObjectNNMean Accuracy63.4PointNet
3D Point Cloud ReconstructionScanObjectNNOverall Accuracy68.2PointNet
3D Point Cloud ReconstructionIntrAF1 score (5-fold)0.684PointNet
3D Point Cloud ReconstructionModelNet40Mean Accuracy86PointNet
3D Point Cloud ReconstructionModelNet40Overall Accuracy89.2PointNet
3D Point Cloud ReconstructionModelNet40-CError Rate0.283PointNet
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Overall Accuracy35.2PointNet
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Standard Deviation13.5PointNet
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Overall Accuracy51.97PointNet
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Standard Deviation12.1PointNet
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Overall Accuracy46.6PointNet
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Standard Deviation13.5PointNet
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Overall Accuracy57.81PointNet
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Standard Deviation15.5PointNet
3D Point Cloud ReconstructionScanObjectNNGFLOPs0.5PointNet
3D Point Cloud ReconstructionScanObjectNNNumber of params (M)3.5PointNet
3D Point Cloud ReconstructionScanObjectNNOverall Accuracy (PB_T50_RS)68PointNet
16knuScenesmAAE0.5PointNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15