TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/RMM: Reinforced Memory Management for Class-Incremental Le...

RMM: Reinforced Memory Management for Class-Incremental Learning

Yaoyao Liu, Bernt Schiele, Qianru Sun

2023-01-14NeurIPS 2021 12Class Incremental LearningManagementclass-incremental learningIncremental Learning
PaperPDFCode(official)CodeCode(official)Code

Abstract

Class-Incremental Learning (CIL) [40] trains classifiers under a strict memory budget: in each incremental phase, learning is done for new data, most of which is abandoned to free space for the next phase. The preserved data are exemplars used for replaying. However, existing methods use a static and ad hoc strategy for memory allocation, which is often sub-optimal. In this work, we propose a dynamic memory management strategy that is optimized for the incremental phases and different object classes. We call our method reinforced memory management (RMM), leveraging reinforcement learning. RMM training is not naturally compatible with CIL as the past, and future data are strictly non-accessible during the incremental phases. We solve this by training the policy function of RMM on pseudo CIL tasks, e.g., the tasks built on the data of the 0-th phase, and then applying it to target tasks. RMM propagates two levels of actions: Level-1 determines how to split the memory between old and new classes, and Level-2 allocates memory for each specific class. In essence, it is an optimizable and general method for memory management that can be used in any replaying-based CIL method. For evaluation, we plug RMM into two top-performing baselines (LUCIR+AANets and POD+AANets [30]) and conduct experiments on three benchmarks (CIFAR-100, ImageNet-Subset, and ImageNet-Full). Our results show clear improvements, e.g., boosting POD+AANets by 3.6%, 4.4%, and 1.9% in the 25-Phase settings of the above benchmarks, respectively.

Results

TaskDatasetMetricValueModel
Incremental LearningCIFAR-100 - 50 classes + 10 steps of 5 classesAverage Incremental Accuracy67.61RMM (Modified ResNet-32)
Incremental LearningImageNet-100 - 50 classes + 25 steps of 2 classesAverage Incremental Accuracy76.54RMM (ResNet-18)
Incremental LearningCIFAR-100 - 50 classes + 5 steps of 10 classesAverage Incremental Accuracy68.86RMM (Modified ResNet-32)
Incremental LearningImageNet - 10 stepsAverage Incremental Accuracy67.45RMM (ResNet-18)
Incremental LearningImageNet - 500 classes + 5 steps of 100 classesAverage Incremental Accuracy69.21RMM (ResNet-18)
Incremental LearningImageNet100 - 10 stepsAverage Incremental Accuracy78.47RMM (ResNet-18)
Incremental LearningCIFAR-100 - 50 classes + 25 steps of 2 classesAverage Incremental Accuracy66.21RMM (Modified ResNet-32)
Incremental LearningImageNet - 500 classes + 25 steps of 20 classesAverage Incremental Accuracy63.93RMM (ResNet-18)
Incremental LearningImageNet - 500 classes + 10 steps of 50 classesAverage Incremental Accuracy67.45RMM (ResNet-18)
Incremental LearningImageNet-100 - 50 classes + 10 steps of 5 classesAverage Incremental Accuracy78.47RMM (ResNet-18)
Incremental LearningImageNet-100 - 50 classes + 5 steps of 10 classesAverage Incremental Accuracy79.52RMM (ResNet-18)

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17The Bayesian Approach to Continual Learning: An Overview2025-07-11Balancing the Past and Present: A Coordinated Replay Framework for Federated Class-Incremental Learning2025-07-10Unpatchable Vulnerabilities in Windows 10/11: Security Report 20252025-07-10DT4PCP: A Digital Twin Framework for Personalized Care Planning Applied to Type 2 Diabetes Management2025-07-10RAPS-3D: Efficient interactive segmentation for 3D radiological imaging2025-07-10Addressing Imbalanced Domain-Incremental Learning through Dual-Balance Collaborative Experts2025-07-09