Positional Prompt Tuning for Efficient 3D Representation Learning

Shaochen Zhang, Zekun Qi, Runpei Dong, Xiuxiu Bai, Xing Wei

2024-08-21Representation Learning 3D Parameter-Efficient Fine-Tuning for Classification parameter-efficient fine-tuning 3D Point Cloud Classification Point Cloud Classification

Paper PDF Code(official)

Abstract

Point cloud analysis has achieved significant development and is well-performed in multiple downstream tasks like point cloud classification and segmentation, etc. Being conscious of the simplicity of the position encoding structure in Transformer-based architectures, we attach importance to the position encoding as a high-dimensional part and the patch encoder to offer multi-scale information. Together with the sequential Transformer, the whole module with position encoding comprehensively constructs a multi-scale feature abstraction module that considers both the local parts from the patch and the global parts from center points as position encoding. With only a few parameters, the position embedding module fits the setting of PEFT (Parameter-Efficient Fine-Tuning) tasks pretty well. Thus we unfreeze these parameters as a fine-tuning part. At the same time, we review the existing prompt and adapter tuning methods, proposing a fresh way of prompts and synthesizing them with adapters as dynamic adjustments. Our Proposed method of PEFT tasks, namely PPT, with only 1.05% of parameters for training, gets state-of-the-art results in several mainstream datasets, such as 95.01% accuracy in the ScanObjectNN OBJ_BG dataset. Codes will be released at https://github.com/zsc000722/PPT.

Results

Task	Dataset	Metric	Value	Model
Shape Representation Of 3D Point Clouds	ScanObjectNN	OBJ-BG (OA)	95.01	ReCon+PPT
Shape Representation Of 3D Point Clouds	ScanObjectNN	OBJ-ONLY (OA)	93.28	ReCon+PPT
Shape Representation Of 3D Point Clouds	ScanObjectNN	Overall Accuracy	89.52	ReCon+PPT
Shape Representation Of 3D Point Clouds	ModelNet40	Overall Accuracy	93.88	PointMAE+PPT
3D Point Cloud Classification	ScanObjectNN	OBJ-BG (OA)	95.01	ReCon+PPT
3D Point Cloud Classification	ScanObjectNN	OBJ-ONLY (OA)	93.28	ReCon+PPT
3D Point Cloud Classification	ScanObjectNN	Overall Accuracy	89.52	ReCon+PPT
3D Point Cloud Classification	ModelNet40	Overall Accuracy	93.88	PointMAE+PPT
3D Point Cloud Reconstruction	ScanObjectNN	OBJ-BG (OA)	95.01	ReCon+PPT
3D Point Cloud Reconstruction	ScanObjectNN	OBJ-ONLY (OA)	93.28	ReCon+PPT
3D Point Cloud Reconstruction	ScanObjectNN	Overall Accuracy	89.52	ReCon+PPT
3D Point Cloud Reconstruction	ModelNet40	Overall Accuracy	93.88	PointMAE+PPT

Positional Prompt Tuning for Efficient 3D Representation Learning

Abstract

Results

Related Papers

Positional Prompt Tuning for Efficient 3D Representation Learning

Abstract

Results

Related Papers