TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Nested Collaborative Learning for Long-Tailed Visual Recog...

Nested Collaborative Learning for Long-Tailed Visual Recognition

Jun Li, Zichang Tan, Jun Wan, Zhen Lei, Guodong Guo

2022-03-29CVPR 2022 1Image ClassificationLong-tail Learning
PaperPDFCode(official)

Abstract

The networks trained on the long-tailed dataset vary remarkably, despite the same training settings, which shows the great uncertainty in long-tailed learning. To alleviate the uncertainty, we propose a Nested Collaborative Learning (NCL), which tackles the problem by collaboratively learning multiple experts together. NCL consists of two core components, namely Nested Individual Learning (NIL) and Nested Balanced Online Distillation (NBOD), which focus on the individual supervised learning for each single expert and the knowledge transferring among multiple experts, respectively. To learn representations more thoroughly, both NIL and NBOD are formulated in a nested way, in which the learning is conducted on not just all categories from a full perspective but some hard categories from a partial perspective. Regarding the learning in the partial perspective, we specifically select the negative categories with high predicted scores as the hard categories by using a proposed Hard Category Mining (HCM). In the NCL, the learning from two perspectives is nested, highly related and complementary, and helps the network to capture not only global and robust features but also meticulous distinguishing ability. Moreover, self-supervision is further utilized for feature enhancement. Extensive experiments manifest the superiority of our method with outperforming the state-of-the-art whether by using a single model or an ensemble.

Results

TaskDatasetMetricValueModel
Image ClassificationPlaces-LTTop-1 Accuracy41.5NCL(ResNet-152)
Image ClassificationCIFAR-100-LT (ρ=50)Error Rate43.2NCL(ResNet32)
Image ClassificationImageNet-LTTop-1 Accuracy58.4NCL(ResNeXt-50)
Image ClassificationImageNet-LTTop-1 Accuracy57.4NCL(ResNet-50)
Image ClassificationCIFAR-10-LT (ρ=50)Error Rate13.2NCL(ResNet32)
Image ClassificationCIFAR-100-LT (ρ=100)Error Rate46.7NCL(ResNet32)
Image ClassificationCIFAR-10-LT (ρ=100)Error Rate15.3NCL(ResNet32)
Few-Shot Image ClassificationPlaces-LTTop-1 Accuracy41.5NCL(ResNet-152)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=50)Error Rate43.2NCL(ResNet32)
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy58.4NCL(ResNeXt-50)
Few-Shot Image ClassificationImageNet-LTTop-1 Accuracy57.4NCL(ResNet-50)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=50)Error Rate13.2NCL(ResNet32)
Few-Shot Image ClassificationCIFAR-100-LT (ρ=100)Error Rate46.7NCL(ResNet32)
Few-Shot Image ClassificationCIFAR-10-LT (ρ=100)Error Rate15.3NCL(ResNet32)
Generalized Few-Shot ClassificationPlaces-LTTop-1 Accuracy41.5NCL(ResNet-152)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=50)Error Rate43.2NCL(ResNet32)
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy58.4NCL(ResNeXt-50)
Generalized Few-Shot ClassificationImageNet-LTTop-1 Accuracy57.4NCL(ResNet-50)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=50)Error Rate13.2NCL(ResNet32)
Generalized Few-Shot ClassificationCIFAR-100-LT (ρ=100)Error Rate46.7NCL(ResNet32)
Generalized Few-Shot ClassificationCIFAR-10-LT (ρ=100)Error Rate15.3NCL(ResNet32)
Long-tail LearningPlaces-LTTop-1 Accuracy41.5NCL(ResNet-152)
Long-tail LearningCIFAR-100-LT (ρ=50)Error Rate43.2NCL(ResNet32)
Long-tail LearningImageNet-LTTop-1 Accuracy58.4NCL(ResNeXt-50)
Long-tail LearningImageNet-LTTop-1 Accuracy57.4NCL(ResNet-50)
Long-tail LearningCIFAR-10-LT (ρ=50)Error Rate13.2NCL(ResNet32)
Long-tail LearningCIFAR-100-LT (ρ=100)Error Rate46.7NCL(ResNet32)
Long-tail LearningCIFAR-10-LT (ρ=100)Error Rate15.3NCL(ResNet32)
Generalized Few-Shot LearningPlaces-LTTop-1 Accuracy41.5NCL(ResNet-152)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=50)Error Rate43.2NCL(ResNet32)
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy58.4NCL(ResNeXt-50)
Generalized Few-Shot LearningImageNet-LTTop-1 Accuracy57.4NCL(ResNet-50)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=50)Error Rate13.2NCL(ResNet32)
Generalized Few-Shot LearningCIFAR-100-LT (ρ=100)Error Rate46.7NCL(ResNet32)
Generalized Few-Shot LearningCIFAR-10-LT (ρ=100)Error Rate15.3NCL(ResNet32)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13