Dipam Goswami, René Schuster, Joost Van de Weijer, Didier Stricker
In class-incremental semantic segmentation (CISS), deep learning architectures suffer from the critical problems of catastrophic forgetting and semantic background shift. Although recent works focused on these issues, existing classifier initialization methods do not address the background shift problem and assign the same initialization weights to both background and new foreground class classifiers. We propose to address the background shift with a novel classifier initialization method which employs gradient-based attribution to identify the most relevant weights for new classes from the classifier's weights for the previous background and transfers these weights to the new classifier. This warm-start weight initialization provides a general solution applicable to several CISS methods. Furthermore, it accelerates learning of new classes while mitigating forgetting. Our experiments demonstrate significant improvement in mIoU compared to the state-of-the-art CISS methods on the Pascal-VOC 2012, ADE20K and Cityscapes datasets.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | Cityscapes | mIoU | 44.9 | MiB+AWT |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 60.7 | SSUL+AWT |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 71.4 | SSUL+AWT |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| Semantic Segmentation | ADE20K | mIoU | 31.1 | MiB+AWT |
| Semantic Segmentation | ADE20K | mIoU | 35.6 | MiB+AWT |
| Semantic Segmentation | ADE20K | mIoU | 33.5 | MiB+AWT |
| Semantic Segmentation | ADE20K | Mean IoU (test) | 33.2 | MiB+AWT |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| Semantic Segmentation | Cityscapes | mIoU | 46.9 | MiB+AWT |
| Continual Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| Continual Learning | Cityscapes | mIoU | 44.9 | MiB+AWT |
| Continual Learning | PASCAL VOC 2012 | mIoU | 60.7 | SSUL+AWT |
| Continual Learning | PASCAL VOC 2012 | Mean IoU (val) | 71.4 | SSUL+AWT |
| Continual Learning | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| Continual Learning | ADE20K | mIoU | 31.1 | MiB+AWT |
| Continual Learning | ADE20K | mIoU | 35.6 | MiB+AWT |
| Continual Learning | ADE20K | mIoU | 33.5 | MiB+AWT |
| Continual Learning | ADE20K | Mean IoU (test) | 33.2 | MiB+AWT |
| Continual Learning | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| Continual Learning | Cityscapes | mIoU | 46.9 | MiB+AWT |
| 2D Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| Class Incremental Learning | Cityscapes | mIoU | 44.9 | MiB+AWT |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 60.7 | SSUL+AWT |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU (val) | 71.4 | SSUL+AWT |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| Class Incremental Learning | ADE20K | mIoU | 31.1 | MiB+AWT |
| Class Incremental Learning | ADE20K | mIoU | 35.6 | MiB+AWT |
| Class Incremental Learning | ADE20K | mIoU | 33.5 | MiB+AWT |
| Class Incremental Learning | ADE20K | Mean IoU (test) | 33.2 | MiB+AWT |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| Class Incremental Learning | Cityscapes | mIoU | 46.9 | MiB+AWT |
| Class-Incremental Semantic Segmentation | Cityscapes | mIoU | 44.9 | MiB+AWT |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 60.7 | SSUL+AWT |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 71.4 | SSUL+AWT |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 31.1 | MiB+AWT |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 35.6 | MiB+AWT |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 33.5 | MiB+AWT |
| Class-Incremental Semantic Segmentation | ADE20K | Mean IoU (test) | 33.2 | MiB+AWT |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| Class-Incremental Semantic Segmentation | Cityscapes | mIoU | 46.9 | MiB+AWT |
| 10-shot image generation | Cityscapes | mIoU | 44.9 | MiB+AWT |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 60.7 | SSUL+AWT |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU (val) | 71.4 | SSUL+AWT |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 67.6 | SSUL+AWT |
| 10-shot image generation | ADE20K | mIoU | 31.1 | MiB+AWT |
| 10-shot image generation | ADE20K | mIoU | 35.6 | MiB+AWT |
| 10-shot image generation | ADE20K | mIoU | 33.5 | MiB+AWT |
| 10-shot image generation | ADE20K | Mean IoU (test) | 33.2 | MiB+AWT |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU (test) | 57.1 | SSUL+AWT |
| 10-shot image generation | Cityscapes | mIoU | 46.9 | MiB+AWT |