Puneet Mangla, Mayank Singh, Abhishek Sinha, Nupur Kumari, Vineeth N. Balasubramanian, Balaji Krishnamurthy
Few-shot learning algorithms aim to learn model parameters capable of adapting to unseen classes with the help of only a few labeled examples. A recent regularization technique - Manifold Mixup focuses on learning a general-purpose representation, robust to small changes in the data distribution. Since the goal of few-shot learning is closely linked to robust representation learning, we study Manifold Mixup in this problem setting. Self-supervised learning is another technique that learns semantically meaningful features, using only the inherent structure of the data. This work investigates the role of learning relevant feature manifold for few-shot tasks using self-supervision and regularization techniques. We observe that regularizing the feature manifold, enriched via self-supervised techniques, with Manifold Mixup significantly improves few-shot learning performance. We show that our proposed method S2M2 beats the current state-of-the-art accuracy on standard few-shot learning datasets like CIFAR-FS, CUB, mini-ImageNet and tiered-ImageNet by 3-8 %. Through extensive experimentation, we show that the features learned using our approach generalize to complex few-shot evaluation tasks, cross-domain scenarios and are robust against slight changes to data distribution.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | CUB 200 5-way 5-shot | Accuracy | 90.85 | S2M2R |
| Image Classification | CUB 200 5-way 1-shot | Accuracy | 80.68 | S2M2R |
| Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 74.81 | S2M2R |
| Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 83.18 | S2M2R |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 64.93 | S2M2R |
| Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 73.71 | S2M2R |
| Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 88.59 | S2M2R |
| Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 87.47 | S2M2R |
| Few-Shot Image Classification | CUB 200 5-way 5-shot | Accuracy | 90.85 | S2M2R |
| Few-Shot Image Classification | CUB 200 5-way 1-shot | Accuracy | 80.68 | S2M2R |
| Few-Shot Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 74.81 | S2M2R |
| Few-Shot Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 83.18 | S2M2R |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 64.93 | S2M2R |
| Few-Shot Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 73.71 | S2M2R |
| Few-Shot Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 88.59 | S2M2R |
| Few-Shot Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 87.47 | S2M2R |