TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Exploring Complementary Strengths of Invariant and Equivar...

Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning

Mamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah

2021-03-01CVPR 2021 1Few-Shot LearningMeta-LearningMetric LearningFew-Shot Image ClassificationKnowledge Distillation
PaperPDFCode(official)

Abstract

In many real-world problems, collecting a large number of labeled samples is infeasible. Few-shot learning (FSL) is the dominant approach to address this issue, where the objective is to quickly adapt to novel categories in presence of a limited number of samples. FSL tasks have been predominantly solved by leveraging the ideas from gradient-based meta-learning and metric learning approaches. However, recent works have demonstrated the significance of powerful feature representations with a simple embedding network that can outperform existing sophisticated FSL algorithms. In this work, we build on this insight and propose a novel training mechanism that simultaneously enforces equivariance and invariance to a general set of geometric transformations. Equivariance or invariance has been employed standalone in the previous works; however, to the best of our knowledge, they have not been used jointly. Simultaneous optimization for both of these contrasting objectives allows the model to jointly learn features that are not only independent of the input transformation but also the features that encode the structure of geometric transformations. These complementary sets of features help generalize well to novel classes with only a few data samples. We achieve additional improvements by incorporating a novel self-supervised distillation objective. Our extensive experimentation shows that even without knowledge distillation our proposed method can outperform current state-of-the-art FSL methods on five popular benchmark datasets.

Results

TaskDatasetMetricValueModel
Image ClassificationMeta-DatasetAccuracy68.89Invariance-Equivariance
Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy77.87Invariance-Equivariance
Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy84.78Invariance-Equivariance
Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy67.28Invariance-Equivariance
Image ClassificationFC100 5-way (5-shot)Accuracy65.3Invariance-Equivariance
Image ClassificationFC100 5-way (1-shot)Accuracy47.76Invariance-Equivariance
Image ClassificationTiered ImageNet 5-way (1-shot)Accuracy72.21Invariance-Equivariance
Image ClassificationTiered ImageNet 5-way (5-shot)Accuracy87.08Invariance-Equivariance
Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy89.74Invariance-Equivariance
Few-Shot Image ClassificationMeta-DatasetAccuracy68.89Invariance-Equivariance
Few-Shot Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy77.87Invariance-Equivariance
Few-Shot Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy84.78Invariance-Equivariance
Few-Shot Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy67.28Invariance-Equivariance
Few-Shot Image ClassificationFC100 5-way (5-shot)Accuracy65.3Invariance-Equivariance
Few-Shot Image ClassificationFC100 5-way (1-shot)Accuracy47.76Invariance-Equivariance
Few-Shot Image ClassificationTiered ImageNet 5-way (1-shot)Accuracy72.21Invariance-Equivariance
Few-Shot Image ClassificationTiered ImageNet 5-way (5-shot)Accuracy87.08Invariance-Equivariance
Few-Shot Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy89.74Invariance-Equivariance

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Unsupervised Ground Metric Learning2025-07-17Uncertainty-Aware Cross-Modal Knowledge Distillation with Prototype Learning for Multimodal Brain-Computer Interfaces2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16