TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Escaping Saddle Points for Effective Generalization on Cla...

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Harsh Rangwani, Sumukh K Aithal, Mayank Mishra, R. Venkatesh Babu

2022-12-28Long-tail Learning
PaperPDFCode(official)

Abstract

Real-world datasets exhibit imbalances of varying types and degrees. Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin-based techniques. Specifically, we examine the spectral density of Hessian of class-wise loss, through which we observe that the network weights converge to a saddle point in the loss landscapes of minority classes. Following this observation, we also find that optimization methods designed to escape from saddle points can be effectively used to improve generalization on minority classes. We further theoretically and empirically demonstrate that Sharpness-Aware Minimization (SAM), a recent technique that encourages convergence to a flat minima, can be effectively used to escape saddle points for minority classes. Using SAM results in a 6.2\% increase in accuracy on the minority classes over the state-of-the-art Vector Scaling Loss, leading to an overall average increase of 4\% across imbalanced datasets. The code is available at: https://github.com/val-iisc/Saddle-LongTail.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-100-LT (ρ=200)Error Rate52PaCo + SAM
Image ClassificationCIFAR-10-LT (ρ=10)Error Rate10.6LDAM + DRW + SAM
Image ClassificationiNaturalist 2018Top-1 Accuracy70.1LDAM + DRW + SAM
Image ClassificationCIFAR-100-LT (ρ=50)Error Rate34.72GLMC + SAM
Image ClassificationImageNet-LTTop-1 Accuracy53.1LDAM + DRW + SAM
Image ClassificationCIFAR-10-LT (ρ=50)Error Rate8.44GLMC + SAM
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate40.99GLMC + SAM
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate47PaCo + SAM
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate53.4VS + SAM
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate10.82GLMC + SAM
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate17.6VS + SAM
Image ClassificationCIFAR-10-LT (ρ=200)Error Rate21.9LDAM + DRW + SAM
Few-Shot Image ClassificationCIFAR-100-LT (ρ=200)Error Rate52PaCo + SAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=10)Error Rate10.6LDAM + DRW + SAM
Few-Shot Image ClassificationiNaturalist 2018Top-1 Accuracy70.1LDAM + DRW + SAM
Few-Shot Image ClassificationCIFAR-100-LT (ρ=50)Error Rate34.72GLMC + SAM
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy53.1LDAM + DRW + SAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=50)Error Rate8.44GLMC + SAM
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate40.99GLMC + SAM
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate47PaCo + SAM
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate53.4VS + SAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate10.82GLMC + SAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate17.6VS + SAM
Few-Shot Image ClassificationCIFAR-10-LT (ρ=200)Error Rate21.9LDAM + DRW + SAM
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=200)Error Rate52PaCo + SAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=10)Error Rate10.6LDAM + DRW + SAM
Generalized Few-Shot ClassificationiNaturalist 2018Top-1 Accuracy70.1LDAM + DRW + SAM
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=50)Error Rate34.72GLMC + SAM
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy53.1LDAM + DRW + SAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=50)Error Rate8.44GLMC + SAM
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate40.99GLMC + SAM
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate47PaCo + SAM
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate53.4VS + SAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate10.82GLMC + SAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate17.6VS + SAM
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=200)Error Rate21.9LDAM + DRW + SAM
Long-tail LearningCIFAR-100-LT (ρ=200)Error Rate52PaCo + SAM
Long-tail LearningCIFAR-10-LT (ρ=10)Error Rate10.6LDAM + DRW + SAM
Long-tail LearningiNaturalist 2018Top-1 Accuracy70.1LDAM + DRW + SAM
Long-tail LearningCIFAR-100-LT (ρ=50)Error Rate34.72GLMC + SAM
Long-tail LearningImageNet-LTTop-1 Accuracy53.1LDAM + DRW + SAM
Long-tail LearningCIFAR-10-LT (ρ=50)Error Rate8.44GLMC + SAM
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate40.99GLMC + SAM
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate47PaCo + SAM
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate53.4VS + SAM
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate10.82GLMC + SAM
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate17.6VS + SAM
Long-tail LearningCIFAR-10-LT (ρ=200)Error Rate21.9LDAM + DRW + SAM
Generalized Few-Shot LearningCIFAR-100-LT (ρ=200)Error Rate52PaCo + SAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=10)Error Rate10.6LDAM + DRW + SAM
Generalized Few-Shot LearningiNaturalist 2018Top-1 Accuracy70.1LDAM + DRW + SAM
Generalized Few-Shot LearningCIFAR-100-LT (ρ=50)Error Rate34.72GLMC + SAM
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy53.1LDAM + DRW + SAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=50)Error Rate8.44GLMC + SAM
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate40.99GLMC + SAM
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate47PaCo + SAM
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate53.4VS + SAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate10.82GLMC + SAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate17.6VS + SAM
Generalized Few-Shot LearningCIFAR-10-LT (ρ=200)Error Rate21.9LDAM + DRW + SAM

Related Papers

Mitigating Spurious Correlations with Causal Logit Perturbation2025-05-21LIFT+: Lightweight Fine-Tuning for Long-Tail Learning2025-04-17Improving Visual Prompt Tuning by Gaussian Neighborhood Minimization for Long-Tailed Visual Recognition2024-10-28Learning from Neighbors: Category Extrapolation for Long-Tail Learning2024-10-21Continuous Contrastive Learning for Long-Tailed Semi-Supervised Recognition2024-10-08AUCSeg: AUC-oriented Pixel-level Long-tail Semantic Segmentation2024-09-30Representation Norm Amplification for Out-of-Distribution Detection in Long-Tail Learning2024-08-20LTRL: Boosting Long-tail Recognition via Reflective Learning2024-07-17