Chang-Bin Zhang, Jia-Wen Xiao, Xialei Liu, Ying-Cong Chen, Ming-Ming Cheng
In this work, we study the continual semantic segmentation problem, where the deep neural networks are required to incorporate new classes continually without catastrophic forgetting. We propose to use a structural re-parameterization mechanism, named representation compensation (RC) module, to decouple the representation learning of both old and new knowledge. The RC module consists of two dynamically evolved branches with one frozen and one trainable. Besides, we design a pooled cube knowledge distillation strategy on both spatial and channel dimensions to further enhance the plasticity and stability of the model. We conduct experiments on two challenging continual semantic segmentation scenarios, continual class segmentation and continual domain segmentation. Without any extra computational overhead and parameters during inference, our method outperforms state-of-the-art performance. The code is available at \url{https://github.com/zhangchbin/RCIL}.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 34.3 | RCNet-101 |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 72.4 | RCNet-101 |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| Semantic Segmentation | ADE20K | mIoU | 29.6 | RCNet-101 |
| Semantic Segmentation | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| Semantic Segmentation | ADE20K | mIoU | 34.5 | RCNet-101 |
| Semantic Segmentation | ADE20K | mIoU | 32.5 | RCNet-101 |
| Semantic Segmentation | ADE20K | Mean IoU (test) | 32.1 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | mIoU | 34.3 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | Mean IoU (val) | 72.4 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| Continual Learning | ADE20K | mIoU | 29.6 | RCNet-101 |
| Continual Learning | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| Continual Learning | ADE20K | mIoU | 34.5 | RCNet-101 |
| Continual Learning | ADE20K | mIoU | 32.5 | RCNet-101 |
| Continual Learning | ADE20K | Mean IoU (test) | 32.1 | RCNet-101 |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| 2D Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| 2D Semantic Segmentation | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 34.3 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU (val) | 72.4 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| Class Incremental Learning | ADE20K | mIoU | 29.6 | RCNet-101 |
| Class Incremental Learning | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| Class Incremental Learning | ADE20K | mIoU | 34.5 | RCNet-101 |
| Class Incremental Learning | ADE20K | mIoU | 32.5 | RCNet-101 |
| Class Incremental Learning | ADE20K | Mean IoU (test) | 32.1 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 34.3 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU (val) | 72.4 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 29.6 | RCNet-101 |
| Class-Incremental Semantic Segmentation | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 34.5 | RCNet-101 |
| Class-Incremental Semantic Segmentation | ADE20K | mIoU | 32.5 | RCNet-101 |
| Class-Incremental Semantic Segmentation | ADE20K | Mean IoU (test) | 32.1 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 34.3 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU (val) | 72.4 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 59.4 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 54.7 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | Mean IoU | 67.3 | RCNet-101 |
| 10-shot image generation | ADE20K | mIoU | 29.6 | RCNet-101 |
| 10-shot image generation | PASCAL VOC 2012 | mIoU | 18.2 | RCNet-101 |
| 10-shot image generation | ADE20K | mIoU | 34.5 | RCNet-101 |
| 10-shot image generation | ADE20K | mIoU | 32.5 | RCNet-101 |
| 10-shot image generation | ADE20K | Mean IoU (test) | 32.1 | RCNet-101 |