TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Do Deep Networks Transfer Invariances Across Classes?

Do Deep Networks Transfer Invariances Across Classes?

Allan Zhou, Fahim Tajwar, Alexander Robey, Tom Knowles, George J. Pappas, Hamed Hassani, Chelsea Finn

2022-03-18ICLR 2022 4Image ClassificationLong-tail Learning
PaperPDFCode(official)

Abstract

To generalize well, classifiers must learn to be invariant to nuisance transformations that do not alter an input's class. Many problems have "class-agnostic" nuisance transformations that apply similarly to all classes, such as lighting and background changes for image classification. Neural networks can learn these invariances given sufficient data, but many real-world datasets are heavily class imbalanced and contain only a few examples for most of the classes. We therefore pose the question: how well do neural networks transfer class-agnostic invariances learned from the large classes to the small ones? Through careful experimentation, we observe that invariance to class-agnostic transformations is still heavily dependent on class size, with the networks being much less invariant on smaller classes. This result holds even when using data balancing techniques, and suggests poor invariance transfer across classes. Our results provide one explanation for why classifiers generalize poorly on unbalanced and long-tailed distributions. Based on this analysis, we show how a generative approach for learning the nuisance transformations can help transfer invariances across classes and improve performance on a set of imbalanced image classification benchmarks. Source code for our experiments is available at https://github.com/AllanYangZhou/generative-invariance-transfer.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate55.65CE+DRS+GIT
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate21.24CE+DRS+GIT
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate55.65CE+DRS+GIT
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate21.24CE+DRS+GIT
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate55.65CE+DRS+GIT
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate21.24CE+DRS+GIT
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate55.65CE+DRS+GIT
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate21.24CE+DRS+GIT
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate55.65CE+DRS+GIT
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate21.24CE+DRS+GIT

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13