On First-Order Meta-Learning Algorithms

Alex Nichol, Joshua Achiam, John Schulman

2018-03-08Few-Shot Learning Meta-Learning Few-Shot Image Classification

Paper PDF Code Code Code Code Code Code Code(official)Code Code Code Code Code Code

Abstract

This paper considers meta-learning problems, where there is a distribution of tasks, and we would like to obtain an agent that performs well (i.e., learns quickly) when presented with a previously unseen task sampled from this distribution. We analyze a family of algorithms for learning a parameter initialization that can be fine-tuned quickly on a new task, using only first-order derivatives for the meta-learning updates. This family includes and generalizes first-order MAML, an approximation to MAML obtained by ignoring second-order derivatives. It also includes Reptile, a new algorithm that we introduce here, which works by repeatedly sampling a task, training on it, and moving the initialization towards the trained weights on that task. We expand on the results from Finn et al. showing that first-order meta-learning algorithms perform well on some well-established benchmarks for few-shot classification, and we provide theoretical analysis aimed at understanding why these algorithms work.

Results

Task	Dataset	Metric	Value	Model
Image Classification	Tiered ImageNet 5-way (5-shot)	Accuracy	71.03	Reptile + BN
Image Classification	Tiered ImageNet 5-way (5-shot)	Accuracy	66.47	Reptile
Image Classification	Tiered ImageNet 10-way (1-shot)	Accuracy	35.3	Reptile+BN
Image Classification	Tiered ImageNet 10-way (1-shot)	Accuracy	33.7	Reptile
Image Classification	Mini-Imagenet 10-way (5-shot)	Accuracy	47.6	Reptile+BN
Image Classification	Mini-Imagenet 10-way (5-shot)	Accuracy	44.7	Reptile
Image Classification	OMNIGLOT - 1-Shot, 5-way	Accuracy	97.68	Reptile + Transduction
Image Classification	OMNIGLOT - 5-Shot, 5-way	Accuracy	99.48	Reptile + Transduction
Image Classification	Mini-Imagenet 5-way (5-shot)	Accuracy	65.99	Reptile + Transduction
Image Classification	Mini-Imagenet 5-way (1-shot)	Accuracy	49.97	Reptile + Transduction
Image Classification	Mini-Imagenet 10-way (1-shot)	Accuracy	32	Reptile+BN
Image Classification	Mini-Imagenet 10-way (1-shot)	Accuracy	31.1	Reptile
Image Classification	Tiered ImageNet 10-way (5-shot)	Accuracy	52	Reptile+BN
Image Classification	Tiered ImageNet 10-way (5-shot)	Accuracy	48	Reptile
Few-Shot Image Classification	Tiered ImageNet 10-way (1-shot)	Accuracy	35.3	Reptile+BN
Few-Shot Image Classification	Tiered ImageNet 10-way (1-shot)	Accuracy	33.7	Reptile
Few-Shot Image Classification	Mini-Imagenet 10-way (5-shot)	Accuracy	47.6	Reptile+BN
Few-Shot Image Classification	Mini-Imagenet 10-way (5-shot)	Accuracy	44.7	Reptile
Few-Shot Image Classification	OMNIGLOT - 1-Shot, 5-way	Accuracy	97.68	Reptile + Transduction
Few-Shot Image Classification	OMNIGLOT - 5-Shot, 5-way	Accuracy	99.48	Reptile + Transduction
Few-Shot Image Classification	Mini-Imagenet 5-way (5-shot)	Accuracy	65.99	Reptile + Transduction
Few-Shot Image Classification	Mini-Imagenet 5-way (1-shot)	Accuracy	49.97	Reptile + Transduction
Few-Shot Image Classification	Mini-Imagenet 10-way (1-shot)	Accuracy	32	Reptile+BN
Few-Shot Image Classification	Mini-Imagenet 10-way (1-shot)	Accuracy	31.1	Reptile
Few-Shot Image Classification	Tiered ImageNet 10-way (5-shot)	Accuracy	52	Reptile+BN
Few-Shot Image Classification	Tiered ImageNet 10-way (5-shot)	Accuracy	48	Reptile

On First-Order Meta-Learning Algorithms

Abstract

Results

Related Papers

On First-Order Meta-Learning Algorithms

Abstract

Results

Related Papers