TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Imbalanced Datasets with Label-Distribution-Aware...

Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss

Kaidi Cao, Colin Wei, Adrien Gaidon, Nikos Arechiga, Tengyu Ma

2019-06-18NeurIPS 2019 12Long-tail LearningLong-tail learning with class descriptors
PaperPDFCodeCodeCode(official)CodeCodeCodeCode

Abstract

Deep learning algorithms can fare poorly when the training dataset suffers from heavy class-imbalance but the testing criterion requires good generalization on less frequent classes. We design two novel methods to improve performance in such scenarios. First, we propose a theoretically-principled label-distribution-aware margin (LDAM) loss motivated by minimizing a margin-based generalization bound. This loss replaces the standard cross-entropy objective during training and can be applied with prior strategies for training with class-imbalance such as re-weighting or re-sampling. Second, we propose a simple, yet effective, training schedule that defers re-weighting until after the initial stage, allowing the model to learn an initial representation while avoiding some of the complications associated with re-weighting or re-sampling. We test our methods on several benchmark vision tasks including the real-world imbalanced dataset iNaturalist 2018. Our experiments show that either of these methods alone can already improve over existing techniques and their combination achieves even better performance gains.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate11.84LDAM-DRW
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate13.21Class-balanced Resampling
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate13.61Empirical Risk Minimization (ERM, CE)
Image ClassificationCIFAR-100-LT (ρ=10)Error Rate41.29LDAM-DRW
Image ClassificationCOCO-MLTAverage mAP40.53LDAM(ResNet-50)
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate57.96LDAM-DRW
Image ClassificationVOC-MLTAverage mAP70.73LDAM(ResNet-50)
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate22.97LDAM-DRW
Image ClassificationCUB-LTLong-Tailed Accuracy64.1LDAM
Image ClassificationCUB-LTPer-Class Accuracy50.1LDAM
Image ClassificationAWA-LTLong-Tailed Accuracy93.5LDAM
Image ClassificationAWA-LTPer-Class Accuracy69.1LDAM
Image ClassificationSUN-LTLong-Tailed Accuracy36.4LDAM
Image ClassificationSUN-LTPer-Class Accuracy29.8LDAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate11.84LDAM-DRW
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate13.21Class-balanced Resampling
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate13.61Empirical Risk Minimization (ERM, CE)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=10)Error Rate41.29LDAM-DRW
Few-Shot Image ClassificationCOCO-MLTAverage mAP40.53LDAM(ResNet-50)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate57.96LDAM-DRW
Few-Shot Image ClassificationVOC-MLTAverage mAP70.73LDAM(ResNet-50)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate22.97LDAM-DRW
Few-Shot Image ClassificationCUB-LTLong-Tailed Accuracy64.1LDAM
Few-Shot Image ClassificationCUB-LTPer-Class Accuracy50.1LDAM
Few-Shot Image ClassificationAWA-LTLong-Tailed Accuracy93.5LDAM
Few-Shot Image ClassificationAWA-LTPer-Class Accuracy69.1LDAM
Few-Shot Image ClassificationSUN-LTLong-Tailed Accuracy36.4LDAM
Few-Shot Image ClassificationSUN-LTPer-Class Accuracy29.8LDAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate11.84LDAM-DRW
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate13.21Class-balanced Resampling
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate13.61Empirical Risk Minimization (ERM, CE)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=10)Error Rate41.29LDAM-DRW
Generalized Few-Shot ClassificationCOCO-MLTAverage mAP40.53LDAM(ResNet-50)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate57.96LDAM-DRW
Generalized Few-Shot ClassificationVOC-MLTAverage mAP70.73LDAM(ResNet-50)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate22.97LDAM-DRW
Generalized Few-Shot ClassificationCUB-LTLong-Tailed Accuracy64.1LDAM
Generalized Few-Shot ClassificationCUB-LTPer-Class Accuracy50.1LDAM
Generalized Few-Shot ClassificationAWA-LTLong-Tailed Accuracy93.5LDAM
Generalized Few-Shot ClassificationAWA-LTPer-Class Accuracy69.1LDAM
Generalized Few-Shot ClassificationSUN-LTLong-Tailed Accuracy36.4LDAM
Generalized Few-Shot ClassificationSUN-LTPer-Class Accuracy29.8LDAM
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate11.84LDAM-DRW
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate13.21Class-balanced Resampling
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate13.61Empirical Risk Minimization (ERM, CE)
Long-tail LearningCIFAR-100-LT (ρ=10)Error Rate41.29LDAM-DRW
Long-tail LearningCOCO-MLTAverage mAP40.53LDAM(ResNet-50)
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate57.96LDAM-DRW
Long-tail LearningVOC-MLTAverage mAP70.73LDAM(ResNet-50)
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate22.97LDAM-DRW
Long-tail LearningCUB-LTLong-Tailed Accuracy64.1LDAM
Long-tail LearningCUB-LTPer-Class Accuracy50.1LDAM
Long-tail LearningAWA-LTLong-Tailed Accuracy93.5LDAM
Long-tail LearningAWA-LTPer-Class Accuracy69.1LDAM
Long-tail LearningSUN-LTLong-Tailed Accuracy36.4LDAM
Long-tail LearningSUN-LTPer-Class Accuracy29.8LDAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate11.84LDAM-DRW
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate13.21Class-balanced Resampling
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate13.61Empirical Risk Minimization (ERM, CE)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=10)Error Rate41.29LDAM-DRW
Generalized Few-Shot LearningCOCO-MLTAverage mAP40.53LDAM(ResNet-50)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate57.96LDAM-DRW
Generalized Few-Shot LearningVOC-MLTAverage mAP70.73LDAM(ResNet-50)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate22.97LDAM-DRW
Generalized Few-Shot LearningCUB-LTLong-Tailed Accuracy64.1LDAM
Generalized Few-Shot LearningCUB-LTPer-Class Accuracy50.1LDAM
Generalized Few-Shot LearningAWA-LTLong-Tailed Accuracy93.5LDAM
Generalized Few-Shot LearningAWA-LTPer-Class Accuracy69.1LDAM
Generalized Few-Shot LearningSUN-LTLong-Tailed Accuracy36.4LDAM
Generalized Few-Shot LearningSUN-LTPer-Class Accuracy29.8LDAM

Related Papers

Mitigating Spurious Correlations with Causal Logit Perturbation2025-05-21LIFT+: Lightweight Fine-Tuning for Long-Tail Learning2025-04-17Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition2024-10-28Learning from Neighbors: Category Extrapolation for Long-Tail Learning2024-10-21Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition2024-10-08AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation2024-09-30Representation Norm Amplification for Out-of-Distribution Detection in Long-Tail Learning2024-08-20LTRL: Boosting Long-tail Recognition via Reflective Learning2024-07-17