TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Region Comparison Network for Interpretable Few-shot Image...

Region Comparison Network for Interpretable Few-shot Image Classification

Zhiyu Xue, Lixin Duan, Wen Li, Lin Chen, Jiebo Luo

2020-09-08Few-Shot LearningImage ClassificationMetric LearningFew-Shot Image ClassificationGeneral ClassificationClassification
PaperPDFCode(official)

Abstract

While deep learning has been successfully applied to many real-world computer vision tasks, training robust classifiers usually requires a large amount of well-labeled data. However, the annotation is often expensive and time-consuming. Few-shot image classification has thus been proposed to effectively use only a limited number of labeled examples to train models for new classes. Recent works based on transferable metric learning methods have achieved promising classification performance through learning the similarity between the features of samples from the query and support sets. However, rare of them explicitly considers the model interpretability, which can actually be revealed during the training phase. For that, in this work, we propose a metric learning based method named Region Comparison Network (RCN), which is able to reveal how few-shot learning works as in a neural network as well as to find out specific regions that are related to each other in images coming from the query and support sets. Moreover, we also present a visualization strategy named Region Activation Mapping (RAM) to intuitively explain what our method has learned by visualizing intermediate variables in our network. We also present a new way to generalize the interpretability from the level of tasks to categories, which can also be viewed as a method to find the prototypical parts for supporting the final decision of our RCN. Extensive experiments on four benchmark datasets clearly show the effectiveness of our method over existing baselines.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy69.02RCN - ResNet12
Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy61.61RCN - Conv4-64
Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy75.19RCN - ResNet12
Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy71.63RCN - Conv4-64
Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy57.4RCN - ResNet12
Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy53.57RCN - Conv4-64
Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy82.96RCN - ResNet12
Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy77.63RCN - Conv4-64
Few-Shot Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy69.02RCN - ResNet12
Few-Shot Image ClassificationCIFAR-FS 5-way (1-shot)Accuracy61.61RCN - Conv4-64
Few-Shot Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy75.19RCN - ResNet12
Few-Shot Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy71.63RCN - Conv4-64
Few-Shot Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy57.4RCN - ResNet12
Few-Shot Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy53.57RCN - Conv4-64
Few-Shot Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy82.96RCN - ResNet12
Few-Shot Image ClassificationCIFAR-FS 5-way (5-shot)Accuracy77.63RCN - Conv4-64

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Unsupervised Ground Metric Learning2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16