Balanced Contrastive Learning for Long-Tailed Visual Recognition

Jianggang Zhu, Zheng Wang, Jingjing Chen, Yi-Ping Phoebe Chen, Yu-Gang Jiang

2022-07-19CVPR 2022 7Image Classification Representation Learning Long-tail Learning Contrastive Learning Long-tail Learning on CIFAR-10-LT (ρ=100)

Paper PDF Code(official)

Abstract

Real-world data typically follow a long-tailed distribution, where a few majority categories occupy most of the data while most minority categories contain a limited number of samples. Classification models minimizing cross-entropy struggle to represent and classify the tail classes. Although the problem of learning unbiased classifiers has been well studied, methods for representing imbalanced data are under-explored. In this paper, we focus on representation learning for imbalanced data. Recently, supervised contrastive learning has shown promising performance on balanced data recently. However, through our theoretical analysis, we find that for long-tailed data, it fails to form a regular simplex which is an ideal geometric configuration for representation learning. To correct the optimization behavior of SCL and further improve the performance of long-tailed visual recognition, we propose a novel loss for balanced contrastive learning (BCL). Compared with SCL, we have two improvements in BCL: class-averaging, which balances the gradient contribution of negative classes; class-complement, which allows all classes to appear in every mini-batch. The proposed balanced contrastive learning (BCL) method satisfies the condition of forming a regular simplex and assists the optimization of cross-entropy. Equipped with BCL, the proposed two-branch framework can obtain a stronger feature representation and achieve competitive performance on long-tailed benchmark datasets such as CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018. Our code is available at https://github.com/FlamieZhu/BCL .

Results

Task	Dataset	Metric	Value	Model
Image Classification	CIFAR-10-LT (ρ=10)	Error Rate	8.9	BCL(ResNet-32)
Image Classification	CIFAR-100-LT (ρ=50)	Error Rate	43.4	BCL(ResNet-32)
Image Classification	ImageNet-LT	Top-1 Accuracy	57.1	BCL(ResNeXt-50)
Image Classification	CIFAR-100-LT (ρ=100)	Error Rate	46.1	BCL(ResNet-32)
Few-Shot Image Classification	CIFAR-10-LT (ρ=10)	Error Rate	8.9	BCL(ResNet-32)
Few-Shot Image Classification	CIFAR-100-LT (ρ=50)	Error Rate	43.4	BCL(ResNet-32)
Few-Shot Image Classification	ImageNet-LT	Top-1 Accuracy	57.1	BCL(ResNeXt-50)
Few-Shot Image Classification	CIFAR-100-LT (ρ=100)	Error Rate	46.1	BCL(ResNet-32)
Generalized Few-Shot Classification	CIFAR-10-LT (ρ=10)	Error Rate	8.9	BCL(ResNet-32)
Generalized Few-Shot Classification	CIFAR-100-LT (ρ=50)	Error Rate	43.4	BCL(ResNet-32)
Generalized Few-Shot Classification	ImageNet-LT	Top-1 Accuracy	57.1	BCL(ResNeXt-50)
Generalized Few-Shot Classification	CIFAR-100-LT (ρ=100)	Error Rate	46.1	BCL(ResNet-32)
Long-tail Learning	CIFAR-10-LT (ρ=10)	Error Rate	8.9	BCL(ResNet-32)
Long-tail Learning	CIFAR-100-LT (ρ=50)	Error Rate	43.4	BCL(ResNet-32)
Long-tail Learning	ImageNet-LT	Top-1 Accuracy	57.1	BCL(ResNeXt-50)
Long-tail Learning	CIFAR-100-LT (ρ=100)	Error Rate	46.1	BCL(ResNet-32)
Generalized Few-Shot Learning	CIFAR-10-LT (ρ=10)	Error Rate	8.9	BCL(ResNet-32)
Generalized Few-Shot Learning	CIFAR-100-LT (ρ=50)	Error Rate	43.4	BCL(ResNet-32)
Generalized Few-Shot Learning	ImageNet-LT	Top-1 Accuracy	57.1	BCL(ResNeXt-50)
Generalized Few-Shot Learning	CIFAR-100-LT (ρ=100)	Error Rate	46.1	BCL(ResNet-32)

Balanced Contrastive Learning for Long-Tailed Visual Recognition

Abstract

Results

Related Papers

Balanced Contrastive Learning for Long-Tailed Visual Recognition

Abstract

Results

Related Papers