Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Koray Kavukcuoglu, Daan Wierstra
Learning from a few examples remains a key challenge in machine learning. Despite recent advances in important domains such as vision and language, the standard supervised deep learning paradigm does not offer a satisfactory solution for learning new concepts rapidly from little data. In this work, we employ ideas from metric learning based on deep neural features and from recent advances that augment neural networks with external memories. Our framework learns a network that maps a small labelled support set and an unlabelled example to its label, obviating the need for fine-tuning to adapt to new class types. We then define one-shot learning problems on vision (using Omniglot, ImageNet) and language tasks. Our algorithm improves one-shot accuracy on ImageNet from 87.6% to 93.2% and from 88.0% to 93.8% on Omniglot compared to competing approaches. We also demonstrate the usefulness of the same model on language modeling by introducing a one-shot task on the Penn Treebank.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | Meta-Dataset | Accuracy | 56.247 | Matching Networks |
| Image Classification | OMNIGLOT - 1-Shot, 5-way | Accuracy | 98.1 | Matching Nets |
| Image Classification | OMNIGLOT - 5-Shot, 5-way | Accuracy | 98.9 | Matching Nets |
| Image Classification | Mini-ImageNet-CUB 5-way (1-shot) | Accuracy | 45.59 | MatchingNet (Vinyals et al., 2016) |
| Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 60 | Matching Nets (Cosine Matching Fn) |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 46.6 | Matching Nets (Cosine Matching Fn) |
| Image Classification | Stanford Dogs 5-way (5-shot) | Accuracy | 47.5 | Matching Nets FCE++ |
| Image Classification | Meta-Dataset Rank | Mean Rank | 10.5 | Matching Networks |
| Image Classification | Stanford Cars 5-way (5-shot) | Accuracy | 44.7 | Matching Nets FCE++ |
| Image Classification | Stanford Cars 5-way (1-shot) | Accuracy | 34.8 | Matching Nets FCE++ |
| Few-Shot Image Classification | Meta-Dataset | Accuracy | 56.247 | Matching Networks |
| Few-Shot Image Classification | OMNIGLOT - 1-Shot, 5-way | Accuracy | 98.1 | Matching Nets |
| Few-Shot Image Classification | OMNIGLOT - 5-Shot, 5-way | Accuracy | 98.9 | Matching Nets |
| Few-Shot Image Classification | Mini-ImageNet-CUB 5-way (1-shot) | Accuracy | 45.59 | MatchingNet (Vinyals et al., 2016) |
| Few-Shot Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 60 | Matching Nets (Cosine Matching Fn) |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 46.6 | Matching Nets (Cosine Matching Fn) |
| Few-Shot Image Classification | Stanford Dogs 5-way (5-shot) | Accuracy | 47.5 | Matching Nets FCE++ |
| Few-Shot Image Classification | Meta-Dataset Rank | Mean Rank | 10.5 | Matching Networks |
| Few-Shot Image Classification | Stanford Cars 5-way (5-shot) | Accuracy | 44.7 | Matching Nets FCE++ |
| Few-Shot Image Classification | Stanford Cars 5-way (1-shot) | Accuracy | 34.8 | Matching Nets FCE++ |