TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Matching Networks for One Shot Learning

Matching Networks for One Shot Learning

Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Koray Kavukcuoglu, Daan Wierstra

2016-06-13NeurIPS 2016 12Few-Shot LearningMetric LearningFew-Shot Image ClassificationOne-Shot LearningLanguage Modelling
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Learning from a few examples remains a key challenge in machine learning. Despite recent advances in important domains such as vision and language, the standard supervised deep learning paradigm does not offer a satisfactory solution for learning new concepts rapidly from little data. In this work, we employ ideas from metric learning based on deep neural features and from recent advances that augment neural networks with external memories. Our framework learns a network that maps a small labelled support set and an unlabelled example to its label, obviating the need for fine-tuning to adapt to new class types. We then define one-shot learning problems on vision (using Omniglot, ImageNet) and language tasks. Our algorithm improves one-shot accuracy on ImageNet from 87.6% to 93.2% and from 88.0% to 93.8% on Omniglot compared to competing approaches. We also demonstrate the usefulness of the same model on language modeling by introducing a one-shot task on the Penn Treebank.

Results

TaskDatasetMetricValueModel
Image ClassificationMeta-DatasetAccuracy56.247Matching Networks
Image ClassificationOMNIGLOT - 1-Shot, 5-wayAccuracy98.1Matching Nets
Image ClassificationOMNIGLOT - 5-Shot, 5-wayAccuracy98.9Matching Nets
Image ClassificationMini-ImageNet-CUB 5-way (1-shot)Accuracy45.59MatchingNet (Vinyals et al., 2016)
Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy60Matching Nets (Cosine Matching Fn)
Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy46.6Matching Nets (Cosine Matching Fn)
Image ClassificationStanford Dogs 5-way (5-shot)Accuracy47.5Matching Nets FCE++
Image ClassificationMeta-Dataset RankMean Rank10.5Matching Networks
Image ClassificationStanford Cars 5-way (5-shot)Accuracy44.7Matching Nets FCE++
Image ClassificationStanford Cars 5-way (1-shot)Accuracy34.8Matching Nets FCE++
Few-Shot Image ClassificationMeta-DatasetAccuracy56.247Matching Networks
Few-Shot Image ClassificationOMNIGLOT - 1-Shot, 5-wayAccuracy98.1Matching Nets
Few-Shot Image ClassificationOMNIGLOT - 5-Shot, 5-wayAccuracy98.9Matching Nets
Few-Shot Image ClassificationMini-ImageNet-CUB 5-way (1-shot)Accuracy45.59MatchingNet (Vinyals et al., 2016)
Few-Shot Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy60Matching Nets (Cosine Matching Fn)
Few-Shot Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy46.6Matching Nets (Cosine Matching Fn)
Few-Shot Image ClassificationStanford Dogs 5-way (5-shot)Accuracy47.5Matching Nets FCE++
Few-Shot Image ClassificationMeta-Dataset RankMean Rank10.5Matching Networks
Few-Shot Image ClassificationStanford Cars 5-way (5-shot)Accuracy44.7Matching Nets FCE++
Few-Shot Image ClassificationStanford Cars 5-way (1-shot)Accuracy34.8Matching Nets FCE++

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Unsupervised Ground Metric Learning2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16