Fabio Cermelli, Massimiliano Mancini, Samuel Rota Bulò, Elisa Ricci, Barbara Caputo
Despite their effectiveness in a wide range of tasks, deep architectures suffer from some important limitations. In particular, they are vulnerable to catastrophic forgetting, i.e. they perform poorly when they are required to update their model as new classes are available but the original training set is not retained. This paper addresses this problem in the context of semantic segmentation. Current strategies fail on this task because they do not consider a peculiar aspect of semantic segmentation: since each training step provides annotation only for a subset of all possible classes, pixels of the background class (i.e. pixels that do not belong to any other classes) exhibit a semantic distribution shift. In this work we revisit classical incremental learning methods, proposing a new distillation-based framework which explicitly accounts for this shift. Furthermore, we introduce a novel strategy to initialize classifier's parameters, thus preventing biased predictions toward the background class. We demonstrate the effectiveness of our approach with an extensive evaluation on the Pascal-VOC 2012 and ADE20K datasets, significantly outperforming state of the art incremental learning methods.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 20.1 | MiB |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 6.9 | MiB |
| Continual Learning | PASCAL VOC 2012 | mIoU | 20.1 | MiB |
| Continual Learning | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| Continual Learning | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| Continual Learning | PASCAL VOC 2012 | mIoU | 6.9 | MiB |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| 2D Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 6.9 | MiB |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 20.1 | MiB |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 6.9 | MiB |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 20.1 | MiB |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 6.9 | MiB |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 20.1 | MiB |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 39.9 | MiB |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU | 65.9 | MiB |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 6.9 | MiB |