Semantic-visual Guided Transformer for Few-shot Class-incremental Learning

Wenhao Qiu, Sichao Fu, Jingyi Zhang, Chengxiang Lei, Qinmu Peng

2023-03-27Few-Shot Class-Incremental Learning Representation Learning Class Incremental Learning class-incremental learning Incremental Learning

Paper PDF

Abstract

Few-shot class-incremental learning (FSCIL) has recently attracted extensive attention in various areas. Existing FSCIL methods highly depend on the robustness of the feature backbone pre-trained on base classes. In recent years, different Transformer variants have obtained significant processes in the feature representation learning of massive fields. Nevertheless, the progress of the Transformer in FSCIL scenarios has not achieved the potential promised in other fields so far. In this paper, we develop a semantic-visual guided Transformer (SV-T) to enhance the feature extracting capacity of the pre-trained feature backbone on incremental classes. Specifically, we first utilize the visual (image) labels provided by the base classes to supervise the optimization of the Transformer. And then, a text encoder is introduced to automatically generate the corresponding semantic (text) labels for each image from the base classes. Finally, the constructed semantic labels are further applied to the Transformer for guiding its hyperparameters updating. Our SV-T can take full advantage of more supervision information from base classes and further enhance the training robustness of the feature backbone. More importantly, our SV-T is an independent method, which can directly apply to the existing FSCIL architectures for acquiring embeddings of various incremental classes. Extensive experiments on three benchmarks, two FSCIL architectures, and two Transformer variants show that our proposed SV-T obtains a significant improvement in comparison to the existing state-of-the-art FSCIL methods.

Results

Task	Dataset	Metric	Value	Model
Continual Learning	CUB-200-2011	Average Accuracy	78.65	SV-T
Continual Learning	CUB-200-2011	Last Accuracy	76.17	SV-T
Continual Learning	CIFAR-100	Average Accuracy	76.84	SV-T
Continual Learning	CIFAR-100	Last Accuracy	69.75	SV-T
Continual Learning	mini-Imagenet	Average Accuracy	85.07	SV-T
Continual Learning	mini-Imagenet	Last Accuracy	81.65	SV-T
Class Incremental Learning	CUB-200-2011	Average Accuracy	78.65	SV-T
Class Incremental Learning	CUB-200-2011	Last Accuracy	76.17	SV-T
Class Incremental Learning	CIFAR-100	Average Accuracy	76.84	SV-T
Class Incremental Learning	CIFAR-100	Last Accuracy	69.75	SV-T
Class Incremental Learning	mini-Imagenet	Average Accuracy	85.07	SV-T
Class Incremental Learning	mini-Imagenet	Last Accuracy	81.65	SV-T

Semantic-visual Guided Transformer for Few-shot Class-incremental Learning

Abstract

Results

Related Papers

Semantic-visual Guided Transformer for Few-shot Class-incremental Learning

Abstract

Results

Related Papers