TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Positional Prompt Tuning for Efficient 3D Representation L...

Positional Prompt Tuning for Efficient 3D Representation Learning

Shaochen Zhang, Zekun Qi, Runpei Dong, Xiuxiu Bai, Xing Wei

2024-08-21Representation Learning3D Parameter-Efficient Fine-Tuning for Classificationparameter-efficient fine-tuning3D Point Cloud ClassificationPoint Cloud Classification
PaperPDFCode(official)

Abstract

Point cloud analysis has achieved significant development and is well-performed in multiple downstream tasks like point cloud classification and segmentation, etc. Being conscious of the simplicity of the position encoding structure in Transformer-based architectures, we attach importance to the position encoding as a high-dimensional part and the patch encoder to offer multi-scale information. Together with the sequential Transformer, the whole module with position encoding comprehensively constructs a multi-scale feature abstraction module that considers both the local parts from the patch and the global parts from center points as position encoding. With only a few parameters, the position embedding module fits the setting of PEFT (Parameter-Efficient Fine-Tuning) tasks pretty well. Thus we unfreeze these parameters as a fine-tuning part. At the same time, we review the existing prompt and adapter tuning methods, proposing a fresh way of prompts and synthesizing them with adapters as dynamic adjustments. Our Proposed method of PEFT tasks, namely PPT, with only 1.05% of parameters for training, gets state-of-the-art results in several mainstream datasets, such as 95.01% accuracy in the ScanObjectNN OBJ_BG dataset. Codes will be released at https://github.com/zsc000722/PPT.

Results

TaskDatasetMetricValueModel
Shape Representation Of 3D Point CloudsScanObjectNNOBJ-BG (OA)95.01ReCon+PPT
Shape Representation Of 3D Point CloudsScanObjectNNOBJ-ONLY (OA)93.28ReCon+PPT
Shape Representation Of 3D Point CloudsScanObjectNNOverall Accuracy89.52ReCon+PPT
Shape Representation Of 3D Point CloudsModelNet40Overall Accuracy93.88PointMAE+PPT
3D Point Cloud ClassificationScanObjectNNOBJ-BG (OA)95.01ReCon+PPT
3D Point Cloud ClassificationScanObjectNNOBJ-ONLY (OA)93.28ReCon+PPT
3D Point Cloud ClassificationScanObjectNNOverall Accuracy89.52ReCon+PPT
3D Point Cloud ClassificationModelNet40Overall Accuracy93.88PointMAE+PPT
3D Point Cloud ReconstructionScanObjectNNOBJ-BG (OA)95.01ReCon+PPT
3D Point Cloud ReconstructionScanObjectNNOBJ-ONLY (OA)93.28ReCon+PPT
3D Point Cloud ReconstructionScanObjectNNOverall Accuracy89.52ReCon+PPT
3D Point Cloud ReconstructionModelNet40Overall Accuracy93.88PointMAE+PPT

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15