TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Long-tailed Recognition by Routing Diverse Distribution-Aw...

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

Xudong Wang, Long Lian, Zhongqi Miao, Ziwei Liu, Stella X. Yu

2020-10-05ICLR 2021 1Image ClassificationLong-tail Learningimbalanced classification
PaperPDFCodeCode(official)

Abstract

Natural data are often long-tail distributed over semantic classes. Existing recognition methods tackle this imbalanced classification by placing more emphasis on the tail data, through class re-balancing/re-weighting or ensembling over different data groups, resulting in increased tail accuracies but reduced head accuracies. We take a dynamic view of the training data and provide a principled model bias and variance analysis as the training data fluctuates: Existing long-tail classifiers invariably increase the model variance and the head-tail model bias gap remains large, due to more and larger confusion with hard negatives for the tail. We propose a new long-tailed classifier called RoutIng Diverse Experts (RIDE). It reduces the model variance with multiple experts, reduces the model bias with a distribution-aware diversity loss, reduces the computational cost with a dynamic expert routing module. RIDE outperforms the state-of-the-art by 5% to 7% on CIFAR100-LT, ImageNet-LT and iNaturalist 2018 benchmarks. It is also a universal framework that is applicable to various backbone networks, long-tailed algorithms, and training mechanisms for consistent performance gains. Our code is available at: https://github.com/frank-xwang/RIDE-LongTailRecognition.

Results

TaskDatasetMetricValueModel
Image ClassificationImageNet-LTTop-1 Accuracy56.4RIDE (ResNeXt-50)
Image ClassificationImageNet-LTTop-1 Accuracy54.9RIDE (ResNet-50)
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate50.9RIDE+distill
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate52RIDE
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy56.4RIDE (ResNeXt-50)
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy54.9RIDE (ResNet-50)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate50.9RIDE+distill
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate52RIDE
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy56.4RIDE (ResNeXt-50)
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy54.9RIDE (ResNet-50)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate50.9RIDE+distill
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate52RIDE
Long-tail LearningImageNet-LTTop-1 Accuracy56.4RIDE (ResNeXt-50)
Long-tail LearningImageNet-LTTop-1 Accuracy54.9RIDE (ResNet-50)
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate50.9RIDE+distill
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate52RIDE
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy56.4RIDE (ResNeXt-50)
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy54.9RIDE (ResNet-50)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate50.9RIDE+distill
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate52RIDE

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13