Eli Schwartz, Leonid Karlinsky, Joseph Shtok, Sivan Harary, Mattias Marder, Rogerio Feris, Abhishek Kumar, Raja Giryes, Alex M. Bronstein
Learning to classify new categories based on just one or a few examples is a long-standing challenge in modern computer vision. In this work, we proposes a simple yet effective method for few-shot (and one-shot) object recognition. Our approach is based on a modified auto-encoder, denoted Delta-encoder, that learns to synthesize new samples for an unseen category just by seeing few examples from it. The synthesized samples are then used to train a classifier. The proposed approach learns to both extract transferable intra-class deformations, or "deltas", between same-class pairs of training examples, and to apply those deltas to the few provided examples of a novel class (unseen during training) in order to efficiently synthesize samples from that new class. The proposed method improves over the state-of-the-art in one-shot object-recognition and compares favorably in the few-shot case. Upon acceptance code will be made available.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | CUB 200 5-way 1-shot | Accuracy | 69.8 | Delta-encoder |
| Image Classification | CIFAR100 5-way (1-shot) | Accuracy | 66.7 | Delta-encoder |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 59.9 | Delta-encoder |
| Image Classification | Caltech-256 5-way (1-shot) | Accuracy | 73.2 | Delta-encoder |
| Few-Shot Image Classification | CUB 200 5-way 1-shot | Accuracy | 69.8 | Delta-encoder |
| Few-Shot Image Classification | CIFAR100 5-way (1-shot) | Accuracy | 66.7 | Delta-encoder |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 59.9 | Delta-encoder |
| Few-Shot Image Classification | Caltech-256 5-way (1-shot) | Accuracy | 73.2 | Delta-encoder |