Jake Snell, Kevin Swersky, Richard S. Zemel
We propose prototypical networks for the problem of few-shot classification, where a classifier must generalize to new classes not seen in the training set, given only a small number of examples of each new class. Prototypical networks learn a metric space in which classification can be performed by computing distances to prototype representations of each class. Compared to recent approaches for few-shot learning, they reflect a simpler inductive bias that is beneficial in this limited-data regime, and achieve excellent results. We provide an analysis showing that some simple design decisions can yield substantial improvements over recent approaches involving complicated architectural choices and meta-learning. We further extend prototypical networks to zero-shot learning and achieve state-of-the-art results on the CU-Birds dataset.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| 2D Pose Estimation | MP100 | Mean PCK@0.2 - 1shot | 44.78 | ProtoNet |
| Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 69.57 | Prototypical Net |
| Image Classification | Meta-Dataset | Accuracy | 60.573 | Prototypical Networks |
| Image Classification | Tiered ImageNet 10-way (1-shot) | Accuracy | 38.6 | Prototypical Networks (Higher Way) |
| Image Classification | Tiered ImageNet 10-way (1-shot) | Accuracy | 37.3 | Prototypical Networks |
| Image Classification | Dirichlet Mini-Imagenet (5-way, 5-shot) | 1:1 Accuracy | 74.2 | ProtoNet |
| Image Classification | Mini-Imagenet 10-way (5-shot) | Accuracy | 50.1 | Prototypical Networks (Higher Way) |
| Image Classification | Mini-Imagenet 10-way (5-shot) | Accuracy | 49.3 | Prototypical Networks |
| Image Classification | OMNIGLOT - 1-Shot, 5-way | Accuracy | 98.8 | Prototypical Networks |
| Image Classification | OMNIGLOT - 5-Shot, 5-way | Accuracy | 99.7 | Prototypical Networks |
| Image Classification | Mini-ImageNet-CUB 5-way (1-shot) | Accuracy | 45.31 | ProtoNet (Snell et al., 2017) |
| Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 68.2 | Prototypical Networks |
| Image Classification | Dirichlet Mini-Imagenet (5-way, 1-shot) | 1:1 Accuracy | 53.6 | ProtoNet |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 49.42 | Prototypical Networks |
| Image Classification | Stanford Dogs 5-way (5-shot) | Accuracy | 48.19 | Prototypical Nets++ |
| Image Classification | Mini-Imagenet 10-way (1-shot) | Accuracy | 34.6 | Prototypical Networks (Higher Way) |
| Image Classification | Mini-Imagenet 10-way (1-shot) | Accuracy | 32.9 | Prototypical Networks |
| Image Classification | Meta-Dataset Rank | Mean Rank | 8.5 | Prototypical Networks |
| Image Classification | Stanford Cars 5-way (5-shot) | Accuracy | 52.93 | Prototypical Nets++ |
| Image Classification | Stanford Cars 5-way (1-shot) | Accuracy | 40.9 | Prototypical Nets++ |
| Image Classification | CUB 200 50-way (0-shot) | Accuracy | 54.6 | Prototypical Networks |
| Image Classification | Mini-Imagenet 5-way (10-shot) | Accuracy | 74.3 | Prototypical Networks |
| Image Classification | Tiered ImageNet 10-way (5-shot) | Accuracy | 58.3 | Prototypical Networks (Higher Way) |
| Image Classification | Tiered ImageNet 10-way (5-shot) | Accuracy | 57.8 | Prototypical Networks |
| Few-Shot Image Classification | Meta-Dataset | Accuracy | 60.573 | Prototypical Networks |
| Few-Shot Image Classification | Tiered ImageNet 10-way (1-shot) | Accuracy | 38.6 | Prototypical Networks (Higher Way) |
| Few-Shot Image Classification | Tiered ImageNet 10-way (1-shot) | Accuracy | 37.3 | Prototypical Networks |
| Few-Shot Image Classification | Dirichlet Mini-Imagenet (5-way, 5-shot) | 1:1 Accuracy | 74.2 | ProtoNet |
| Few-Shot Image Classification | Mini-Imagenet 10-way (5-shot) | Accuracy | 50.1 | Prototypical Networks (Higher Way) |
| Few-Shot Image Classification | Mini-Imagenet 10-way (5-shot) | Accuracy | 49.3 | Prototypical Networks |
| Few-Shot Image Classification | OMNIGLOT - 1-Shot, 5-way | Accuracy | 98.8 | Prototypical Networks |
| Few-Shot Image Classification | OMNIGLOT - 5-Shot, 5-way | Accuracy | 99.7 | Prototypical Networks |
| Few-Shot Image Classification | Mini-ImageNet-CUB 5-way (1-shot) | Accuracy | 45.31 | ProtoNet (Snell et al., 2017) |
| Few-Shot Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 68.2 | Prototypical Networks |
| Few-Shot Image Classification | Dirichlet Mini-Imagenet (5-way, 1-shot) | 1:1 Accuracy | 53.6 | ProtoNet |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 49.42 | Prototypical Networks |
| Few-Shot Image Classification | Stanford Dogs 5-way (5-shot) | Accuracy | 48.19 | Prototypical Nets++ |
| Few-Shot Image Classification | Mini-Imagenet 10-way (1-shot) | Accuracy | 34.6 | Prototypical Networks (Higher Way) |
| Few-Shot Image Classification | Mini-Imagenet 10-way (1-shot) | Accuracy | 32.9 | Prototypical Networks |
| Few-Shot Image Classification | Meta-Dataset Rank | Mean Rank | 8.5 | Prototypical Networks |
| Few-Shot Image Classification | Stanford Cars 5-way (5-shot) | Accuracy | 52.93 | Prototypical Nets++ |
| Few-Shot Image Classification | Stanford Cars 5-way (1-shot) | Accuracy | 40.9 | Prototypical Nets++ |
| Few-Shot Image Classification | CUB 200 50-way (0-shot) | Accuracy | 54.6 | Prototypical Networks |
| Few-Shot Image Classification | Mini-Imagenet 5-way (10-shot) | Accuracy | 74.3 | Prototypical Networks |
| Few-Shot Image Classification | Tiered ImageNet 10-way (5-shot) | Accuracy | 58.3 | Prototypical Networks (Higher Way) |
| Few-Shot Image Classification | Tiered ImageNet 10-way (5-shot) | Accuracy | 57.8 | Prototypical Networks |
| 2D Classification | MP100 | Mean PCK@0.2 - 1shot | 44.78 | ProtoNet |