Overcoming catastrophic forgetting in neural networks

James Kirkpatrick, Razvan Pascanu, Neil Rabinowitz, Joel Veness, Guillaume Desjardins, Andrei A. Rusu, Kieran Milan, John Quan, Tiago Ramalho, Agnieszka Grabska-Barwinska, Demis Hassabis, Claudia Clopath, Dharshan Kumaran, Raia Hadsell

2016-12-02Continual Learning Class Incremental Learning Atari Games class-incremental learning General Classification Incremental Learning

Paper PDF Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code

Abstract

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable feature of connectionist models. We show that it is possible to overcome this limitation and train networks that can maintain expertise on tasks which they have not experienced for a long time. Our approach remembers old tasks by selectively slowing down learning on the weights important for those tasks. We demonstrate our approach is scalable and effective by solving a set of classification tasks based on the MNIST hand written digit dataset and by learning several Atari 2600 games sequentially.

Results

Task	Dataset	Metric	Value	Model
Continual Learning	20Newsgroup (10 tasks)	F1 - macro	0.918	EWC
Continual Learning	F-CelebA (10 tasks)	Acc	0.6545	EWC
Continual Learning	ASC (19 tasks)	F1 - macro	0.7452	EWC
Continual Learning	ASC (19 tasks)	F1 - macro	0.5243	L2
Continual Learning	DSC (10 tasks)	F1 - macro	0.6576	EWC
class-incremental learning	cifar100	10-stage average accuracy	50.53	EWC

Related Papers

RegCL: Continual Adaptation of Segment Anything Model via Model Merging2025-07-16 Information-Theoretic Generalization Bounds of Replay-based Continual Learning2025-07-16 PROL : Rehearsal Free Continual Learning in Streaming Data via Prompt Online Learning2025-07-16 Fast Last-Iterate Convergence of SGD in the Smooth Interpolation Regime2025-07-15 A Neural Network Model of Complementary Learning Systems: Pattern Separation and Completion for Continual Learning2025-07-15 LifelongPR: Lifelong knowledge fusion for point cloud place recognition based on replay and prompt learning2025-07-14 Overcoming catastrophic forgetting in neural networks2025-07-14 Continual Reinforcement Learning by Planning with Online World Models2025-07-12