TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Global and Local Mixture Consistency Cumulative Learning f...

Global and Local Mixture Consistency Cumulative Learning for Long-tailed Visual Recognitions

Fei Du, Peng Yang, Qi Jia, Fengtao Nan, Xiaoting Chen, Yun Yang

2023-05-15CVPR 2023 1Long-tail Learning
PaperPDFCodeCode(official)

Abstract

In this paper, our goal is to design a simple learning paradigm for long-tail visual recognition, which not only improves the robustness of the feature extractor but also alleviates the bias of the classifier towards head classes while reducing the training skills and overhead. We propose an efficient one-stage training strategy for long-tailed visual recognition called Global and Local Mixture Consistency cumulative learning (GLMC). Our core ideas are twofold: (1) a global and local mixture consistency loss improves the robustness of the feature extractor. Specifically, we generate two augmented batches by the global MixUp and local CutMix from the same batch data, respectively, and then use cosine similarity to minimize the difference. (2) A cumulative head tail soft label reweighted loss mitigates the head class bias problem. We use empirical class frequencies to reweight the mixed label of the head-tail class for long-tailed data and then balance the conventional loss and the rebalanced loss with a coefficient accumulated by epochs. Our approach achieves state-of-the-art accuracy on CIFAR10-LT, CIFAR100-LT, and ImageNet-LT datasets. Additional experiments on balanced ImageNet and CIFAR demonstrate that GLMC can significantly improve the generalization of backbones. Code is made publicly available at https://github.com/ynu-yangpeng/GLMC.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate5GLMC+MaxNorm (ResNet-34, channel x4)
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate5.15GLMC (ResNet-34, channel x4)
Image ClassificationCIFAR-100-LT (ρ=50)Error Rate36.15GLMC (ResNet-34, channel x4)
Image ClassificationCIFAR-100-LT (ρ=10)Error Rate25.72GLMC+MaxNorm (ResNet-32, channel x4)
Image ClassificationCIFAR-100-LT (ρ=10)Error Rate26.53GLMC (ResNet-34, channel x4)
Image ClassificationImageNet-LTTop-1 Accuracy56.3GLMC (ResNeXt-50)
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate41.59GLMC+MaxNorm (ResNet-34, channel x4)
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate42.01GLMC (ResNet-34, channel x4)
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate10.42GLMC+MaxNorm (ResNet-34, channel x4)
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate11.5GLMC (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate5GLMC+MaxNorm (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate5.15GLMC (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=50)Error Rate36.15GLMC (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=10)Error Rate25.72GLMC+MaxNorm (ResNet-32, channel x4)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=10)Error Rate26.53GLMC (ResNet-34, channel x4)
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy56.3GLMC (ResNeXt-50)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate41.59GLMC+MaxNorm (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate42.01GLMC (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate10.42GLMC+MaxNorm (ResNet-34, channel x4)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate11.5GLMC (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate5GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate5.15GLMC (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=50)Error Rate36.15GLMC (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=10)Error Rate25.72GLMC+MaxNorm (ResNet-32, channel x4)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=10)Error Rate26.53GLMC (ResNet-34, channel x4)
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy56.3GLMC (ResNeXt-50)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate41.59GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate42.01GLMC (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate10.42GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate11.5GLMC (ResNet-34, channel x4)
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate5GLMC+MaxNorm (ResNet-34, channel x4)
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate5.15GLMC (ResNet-34, channel x4)
Long-tail LearningCIFAR-100-LT (ρ=50)Error Rate36.15GLMC (ResNet-34, channel x4)
Long-tail LearningCIFAR-100-LT (ρ=10)Error Rate25.72GLMC+MaxNorm (ResNet-32, channel x4)
Long-tail LearningCIFAR-100-LT (ρ=10)Error Rate26.53GLMC (ResNet-34, channel x4)
Long-tail LearningImageNet-LTTop-1 Accuracy56.3GLMC (ResNeXt-50)
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate41.59GLMC+MaxNorm (ResNet-34, channel x4)
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate42.01GLMC (ResNet-34, channel x4)
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate10.42GLMC+MaxNorm (ResNet-34, channel x4)
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate11.5GLMC (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate5GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate5.15GLMC (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=50)Error Rate36.15GLMC (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=10)Error Rate25.72GLMC+MaxNorm (ResNet-32, channel x4)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=10)Error Rate26.53GLMC (ResNet-34, channel x4)
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy56.3GLMC (ResNeXt-50)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate41.59GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate42.01GLMC (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate10.42GLMC+MaxNorm (ResNet-34, channel x4)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate11.5GLMC (ResNet-34, channel x4)

Related Papers

Mitigating Spurious Correlations with Causal Logit Perturbation2025-05-21LIFT+: Lightweight Fine-Tuning for Long-Tail Learning2025-04-17Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition2024-10-28Learning from Neighbors: Category Extrapolation for Long-Tail Learning2024-10-21Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition2024-10-08AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation2024-09-30Representation Norm Amplification for Out-of-Distribution Detection in Long-Tail Learning2024-08-20LTRL: Boosting Long-tail Recognition via Reflective Learning2024-07-17