Dahyun Kang, Heeseung Kwon, Juhong Min, Minsu Cho
We propose to address the problem of few-shot classification by meta-learning "what to observe" and "where to attend" in a relational perspective. Our method leverages relational patterns within and between images via self-correlational representation (SCR) and cross-correlational attention (CCA). Within each image, the SCR module transforms a base feature map into a self-correlation tensor and learns to extract structural patterns from the tensor. Between the images, the CCA module computes cross-correlation between two image representations and learns to produce co-attention between them. Our Relational Embedding Network (RENet) combines the two relational modules to learn relational embedding in an end-to-end manner. In experimental evaluation, it achieves consistent improvements over state-of-the-art methods on four widely used few-shot classification benchmarks of miniImageNet, tieredImageNet, CUB-200-2011, and CIFAR-FS.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Classification | CUB 200 5-way 5-shot | Accuracy | 91.11 | RENet |
| Image Classification | CUB 200 5-way 1-shot | Accuracy | 79.49 | RENet |
| Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 74.51 | RENet |
| Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 82.58 | RENet |
| Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 67.6 | RENet |
| Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 71.61 | RENet |
| Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 85.28 | RENet |
| Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 86.6 | RENet |
| Few-Shot Image Classification | CUB 200 5-way 5-shot | Accuracy | 91.11 | RENet |
| Few-Shot Image Classification | CUB 200 5-way 1-shot | Accuracy | 79.49 | RENet |
| Few-Shot Image Classification | CIFAR-FS 5-way (1-shot) | Accuracy | 74.51 | RENet |
| Few-Shot Image Classification | Mini-Imagenet 5-way (5-shot) | Accuracy | 82.58 | RENet |
| Few-Shot Image Classification | Mini-Imagenet 5-way (1-shot) | Accuracy | 67.6 | RENet |
| Few-Shot Image Classification | Tiered ImageNet 5-way (1-shot) | Accuracy | 71.61 | RENet |
| Few-Shot Image Classification | Tiered ImageNet 5-way (5-shot) | Accuracy | 85.28 | RENet |
| Few-Shot Image Classification | CIFAR-FS 5-way (5-shot) | Accuracy | 86.6 | RENet |