TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/On First-Order Meta-Learning Algorithms

On First-Order Meta-Learning Algorithms

Alex Nichol, Joshua Achiam, John Schulman

2018-03-08Few-Shot LearningMeta-LearningFew-Shot Image Classification
PaperPDFCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCode

Abstract

This paper considers meta-learning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution. We analyze a family of algorithms for learning a parameter initialization that can be fine-tuned quickly on a new task, using only first-order derivatives for the meta-learning updates. This family includes and generalizes first-order MAML, an approximation to MAML obtained by ignoring second-order derivatives. It also includes Reptile, a new algorithm that we introduce here, which works by repeatedly sampling a task, training on it, and moving the initialization towards the trained weights on that task. We expand on the results from Finn et al. showing that first-order meta-learning algorithms perform well on some well-established benchmarks for few-shot classification, and we provide theoretical analysis aimed at understanding why these algorithms work.

Results

TaskDatasetMetricValueModel
Image ClassificationTiered ImageNet 5-way (5-shot)Accuracy71.03Reptile + BN
Image ClassificationTiered ImageNet 5-way (5-shot)Accuracy66.47Reptile
Image ClassificationTiered ImageNet 10-way (1-shot)Accuracy35.3Reptile+BN
Image ClassificationTiered ImageNet 10-way (1-shot)Accuracy33.7Reptile
Image ClassificationMini-Imagenet 10-way (5-shot)Accuracy47.6Reptile+BN
Image ClassificationMini-Imagenet 10-way (5-shot)Accuracy44.7Reptile
Image ClassificationOMNIGLOT - 1-Shot, 5-wayAccuracy97.68Reptile + Transduction
Image ClassificationOMNIGLOT - 5-Shot, 5-wayAccuracy99.48Reptile + Transduction
Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy65.99Reptile + Transduction
Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy49.97Reptile + Transduction
Image ClassificationMini-Imagenet 10-way (1-shot)Accuracy32Reptile+BN
Image ClassificationMini-Imagenet 10-way (1-shot)Accuracy31.1Reptile
Image ClassificationTiered ImageNet 10-way (5-shot)Accuracy52Reptile+BN
Image ClassificationTiered ImageNet 10-way (5-shot)Accuracy48Reptile
Few-Shot Image ClassificationTiered ImageNet 10-way (1-shot)Accuracy35.3Reptile+BN
Few-Shot Image ClassificationTiered ImageNet 10-way (1-shot)Accuracy33.7Reptile
Few-Shot Image ClassificationMini-Imagenet 10-way (5-shot)Accuracy47.6Reptile+BN
Few-Shot Image ClassificationMini-Imagenet 10-way (5-shot)Accuracy44.7Reptile
Few-Shot Image ClassificationOMNIGLOT - 1-Shot, 5-wayAccuracy97.68Reptile + Transduction
Few-Shot Image ClassificationOMNIGLOT - 5-Shot, 5-wayAccuracy99.48Reptile + Transduction
Few-Shot Image ClassificationMini-Imagenet 5-way (5-shot)Accuracy65.99Reptile + Transduction
Few-Shot Image ClassificationMini-Imagenet 5-way (1-shot)Accuracy49.97Reptile + Transduction
Few-Shot Image ClassificationMini-Imagenet 10-way (1-shot)Accuracy32Reptile+BN
Few-Shot Image ClassificationMini-Imagenet 10-way (1-shot)Accuracy31.1Reptile
Few-Shot Image ClassificationTiered ImageNet 10-way (5-shot)Accuracy52Reptile+BN
Few-Shot Image ClassificationTiered ImageNet 10-way (5-shot)Accuracy48Reptile

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16Mixture of Experts in Large Language Models2025-07-15Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks2025-07-13ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark Evaluation2025-07-12