TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Meta-SGD: Learning to Learn Quickly for Few-Shot Learning

Meta-SGD: Learning to Learn Quickly for Few-Shot Learning

Zhenguo Li, Fengwei Zhou, Fei Chen, Hang Li

2017-07-31Few-Shot LearningMeta-LearningReinforcement Learningreinforcement-learning
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Few-shot learning is challenging for learning algorithms that learn each task in isolation and from scratch. In contrast, meta-learning learns from many related tasks a meta-learner that can learn a new task more accurately and faster with fewer examples, where the choice of meta-learners is crucial. In this paper, we develop Meta-SGD, an SGD-like, easily trainable meta-learner that can initialize and adapt any differentiable learner in just one step, on both supervised learning and reinforcement learning. Compared to the popular meta-learner LSTM, Meta-SGD is conceptually simpler, easier to implement, and can be learned more efficiently. Compared to the latest meta-learner MAML, Meta-SGD has a much higher capacity by learning to learn not just the learner initialization, but also the learner update direction and learning rate, all in a single meta-learning process. Meta-SGD shows highly competitive performance for few-shot learning on regression, classification, and reinforcement learning.

Results

TaskDatasetMetricValueModel
Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy17.56Meta SGD
Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy17.31Matching Nets, (from )
Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy16.7Meta LSTM, (from )
Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy16.49MAML, (from )
Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy28.92Meta SGD
Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy26.06Meta LSTM, (from )
Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy22.69Matching Nets, (from )
Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy19.29MAML, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy17.56Meta SGD
Few-Shot Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy17.31Matching Nets, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy16.7Meta LSTM, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (1-shot)Accuracy16.49MAML, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy28.92Meta SGD
Few-Shot Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy26.06Meta LSTM, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy22.69Matching Nets, (from )
Few-Shot Image ClassificationMini-Imagenet 20-way (5-shot)Accuracy19.29MAML, (from )

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18GLAD: Generalizable Tuning for Vision-Language Models2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17