Umberto Michieli, Pietro Zanuttigh
Deep neural networks suffer from the major limitation of catastrophic forgetting old tasks when learning new ones. In this paper we focus on class incremental continual learning in semantic segmentation, where new categories are made available over time while previous training data is not retained. The proposed continual learning scheme shapes the latent space to reduce forgetting whilst improving the recognition of novel classes. Our framework is driven by three novel components which we also combine on top of existing techniques effortlessly. First, prototypes matching enforces latent space consistency on old classes, constraining the encoder to produce similar latent representation for previously seen classes in the subsequent steps. Second, features sparsification allows to make room in the latent space to accommodate novel classes. Finally, contrastive learning is employed to cluster features according to their semantics while tearing apart those of different classes. Extensive evaluation on the Pascal VOC2012 and ADE20K datasets demonstrates the effectiveness of our approach, significantly outperforming state-of-the-art methods.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 25.1 | SDR |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 70.1 | SDR |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 14.3 | SDR |
| Continual Learning | PASCAL VOC 2012 | mIoU | 25.1 | SDR |
| Continual Learning | PASCAL VOC 2012 | Mean IoU (val) | 70.1 | SDR |
| Continual Learning | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| Continual Learning | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| Continual Learning | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| Continual Learning | PASCAL VOC 2012 | mIoU | 14.3 | SDR |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| 2D Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 14.3 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 25.1 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU (val) | 70.1 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 14.3 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 25.1 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 70.1 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 14.3 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 25.1 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU (val) | 70.1 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 39.5 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 48.7 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU | 67.3 | SDR |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 14.3 | SDR |