Class-Balanced Distillation for Long-Tailed Visual Recognition

Ahmet Iscen, André Araujo, Boqing Gong, Cordelia Schmid

2021-04-12Image Classification Long-tail Learning Knowledge Distillation

Paper PDF Code Code(official)Code(official)

Abstract

Real-world imagery is often characterized by a significant imbalance of the number of images per class, leading to long-tailed distributions. An effective and simple approach to long-tailed visual recognition is to learn feature representations and a classifier separately, with instance and class-balanced sampling, respectively. In this work, we introduce a new framework, by making the key observation that a feature representation learned with instance sampling is far from optimal in a long-tailed setting. Our main contribution is a new training method, referred to as Class-Balanced Distillation (CBD), that leverages knowledge distillation to enhance feature representations. CBD allows the feature representation to evolve in the second training stage, guided by the teacher learned in the first stage. The second stage uses class-balanced sampling, in order to focus on under-represented classes. This framework can naturally accommodate the usage of multiple teachers, unlocking the information from an ensemble of models to enhance recognition capabilities. Our experiments show that the proposed technique consistently outperforms the state of the art on long-tailed recognition benchmarks such as ImageNet-LT, iNaturalist17 and iNaturalist18.

Results

Task	Dataset	Metric	Value	Model
Image Classification	ImageNet-LT	Top-1 Accuracy	57.7	CBD-ENS (ResNet-152)
Image Classification	ImageNet-LT	Top-1 Accuracy	55.6	CBD-ENS (ResNet-50)
Few-Shot Image Classification	ImageNet-LT	Top-1 Accuracy	57.7	CBD-ENS (ResNet-152)
Few-Shot Image Classification	ImageNet-LT	Top-1 Accuracy	55.6	CBD-ENS (ResNet-50)
Generalized Few-Shot Classification	ImageNet-LT	Top-1 Accuracy	57.7	CBD-ENS (ResNet-152)
Generalized Few-Shot Classification	ImageNet-LT	Top-1 Accuracy	55.6	CBD-ENS (ResNet-50)
Long-tail Learning	ImageNet-LT	Top-1 Accuracy	57.7	CBD-ENS (ResNet-152)
Long-tail Learning	ImageNet-LT	Top-1 Accuracy	55.6	CBD-ENS (ResNet-50)
Generalized Few-Shot Learning	ImageNet-LT	Top-1 Accuracy	57.7	CBD-ENS (ResNet-152)
Generalized Few-Shot Learning	ImageNet-LT	Top-1 Accuracy	55.6	CBD-ENS (ResNet-50)

Class-Balanced Distillation for Long-Tailed Visual Recognition

Abstract

Results

Related Papers

Class-Balanced Distillation for Long-Tailed Visual Recognition

Abstract

Results

Related Papers