TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Instance-aware Dynamic Prompt Tuning for Pre-trained Point...

Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models

Yaohua Zha, Jinpeng Wang, Tao Dai, Bin Chen, Zhi Wang, Shu-Tao Xia

2023-04-14ICCV 2023 13D Parameter-Efficient Fine-Tuning for ClassificationFew-Shot 3D Point Cloud Classification3D Point Cloud ClassificationVisual Prompt Tuning
PaperPDFCode(official)Code(official)Code

Abstract

Pre-trained point cloud models have found extensive applications in 3D understanding tasks like object classification and part segmentation. However, the prevailing strategy of full fine-tuning in downstream tasks leads to large per-task storage overhead for model parameters, which limits the efficiency when applying large-scale pre-trained models. Inspired by the recent success of visual prompt tuning (VPT), this paper attempts to explore prompt tuning on pre-trained point cloud models, to pursue an elegant balance between performance and parameter efficiency. We find while instance-agnostic static prompting, e.g. VPT, shows some efficacy in downstream transfer, it is vulnerable to the distribution diversity caused by various types of noises in real-world point cloud data. To conquer this limitation, we propose a novel Instance-aware Dynamic Prompt Tuning (IDPT) strategy for pre-trained point cloud models. The essence of IDPT is to develop a dynamic prompt generation module to perceive semantic prior features of each point cloud instance and generate adaptive prompt tokens to enhance the model's robustness. Notably, extensive experiments demonstrate that IDPT outperforms full fine-tuning in most tasks with a mere 7% of the trainable parameters, providing a promising solution to parameter-efficient learning for pre-trained point cloud models. Code is available at \url{https://github.com/zyh16143998882/ICCV23-IDPT}.

Results

TaskDatasetMetricValueModel
Shape Representation Of 3D Point CloudsScanObjectNNOBJ-BG (OA)93.12IDPT
Shape Representation Of 3D Point CloudsScanObjectNNOverall Accuracy88.51IDPT
Shape Representation Of 3D Point CloudsModelNet40Overall Accuracy94.4IDPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (20-shot)Overall Accuracy95.4IDPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (10-shot)Overall Accuracy97.3IDPT
Shape Representation Of 3D Point CloudsModelNet40 10-way (10-shot)Overall Accuracy92.8IDPT
Shape Representation Of 3D Point CloudsModelNet40 5-way (20-shot)Overall Accuracy97.9IDPT
3D Point Cloud ClassificationScanObjectNNOBJ-BG (OA)93.12IDPT
3D Point Cloud ClassificationScanObjectNNOverall Accuracy88.51IDPT
3D Point Cloud ClassificationModelNet40Overall Accuracy94.4IDPT
3D Point Cloud ClassificationModelNet40 10-way (20-shot)Overall Accuracy95.4IDPT
3D Point Cloud ClassificationModelNet40 5-way (10-shot)Overall Accuracy97.3IDPT
3D Point Cloud ClassificationModelNet40 10-way (10-shot)Overall Accuracy92.8IDPT
3D Point Cloud ClassificationModelNet40 5-way (20-shot)Overall Accuracy97.9IDPT
3D Point Cloud ReconstructionScanObjectNNOBJ-BG (OA)93.12IDPT
3D Point Cloud ReconstructionScanObjectNNOverall Accuracy88.51IDPT
3D Point Cloud ReconstructionModelNet40Overall Accuracy94.4IDPT
3D Point Cloud ReconstructionModelNet40 10-way (20-shot)Overall Accuracy95.4IDPT
3D Point Cloud ReconstructionModelNet40 5-way (10-shot)Overall Accuracy97.3IDPT
3D Point Cloud ReconstructionModelNet40 10-way (10-shot)Overall Accuracy92.8IDPT
3D Point Cloud ReconstructionModelNet40 5-way (20-shot)Overall Accuracy97.9IDPT

Related Papers

Prompt Disentanglement via Language Guidance and Representation Alignment for Domain Generalization2025-07-03Attention to Burstiness: Low-Rank Bilinear Prompt Tuning2025-06-28Asymmetric Dual Self-Distillation for 3D Self-Supervised Representation Learning2025-06-26DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers2025-05-29Rethinking Gradient-based Adversarial Attacks on Point Cloud Classification2025-05-28SMART-PC: Skeletal Model Adaptation for Robust Test-Time Training in Point Clouds2025-05-26DG-MVP: 3D Domain Generalization via Multiple Views of Point Clouds for Classification2025-04-16Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning2025-04-02