Kwonjoon Lee, Subhransu Maji, Avinash Ravichandran, Stefano Soatto
Many meta-learning approaches for few-shot learning rely on simple base learners such as nearest-neighbor classifiers. However, even in the few-shot regime, discriminatively trained linear predictors can offer better generalization. We propose to use these predictors as base learners to learn representations for few-shot learning and show they offer better tradeoffs between feature size and performance across a range of few-shot recognition benchmarks. Our objective is to learn feature embeddings that generalize well under a linear classification rule for novel categories. To efficiently solve the objective, we exploit two properties of linear classifiers: implicit differentiation of the optimality conditions of the convex problem and the dual formulation of the optimization problem. This allows us to use high-dimensional embeddings with improved generalization at a modest increase in computational overhead. Our approach, named MetaOptNet, achieves state-of-the-art performance on miniImageNet, tieredImageNet, CIFAR-FS, and FC100 few-shot learning benchmarks. Our code is available at https://github.com/kjunelee/MetaOptNet.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 72.8 | MetaOptNet-SVM-trainval |
| Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 80 | MetaOptNet-SVM-trainval |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 64.09 | MetaOptNet-SVM-trainval |
| Image Classification | FC100 5-way (5-shot) | Accuracy | 62.5 | MetaOptNet-SVM-trainval |
| Image Classification | FC100 5-way (1-shot) | Accuracy | 47.2 | MetaOptNet-SVM-trainval |
| Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 65.81 | MetaOptNet-SVM-trainval |
| Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 81.75 | MetaOptNet-SVM-trainval |
| Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 85 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 72.8 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 80 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 64.09 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | FC100 5-way (5-shot) | Accuracy | 62.5 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | FC100 5-way (1-shot) | Accuracy | 47.2 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 65.81 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 81.75 | MetaOptNet-SVM-trainval |
| Few-Shot Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 85 | MetaOptNet-SVM-trainval |