Xiangyu He, Zitao Mo, Qiang Chen, Anda Cheng, Peisong Wang, Jian Cheng
Many successful learning targets such as minimizing dice loss and cross-entropy loss have enabled unprecedented breakthroughs in segmentation tasks. Beyond these semantic metrics, this paper aims to introduce location supervision into semantic segmentation. Based on this idea, we present a Location-aware Upsampling (LaU) that adaptively refines the interpolating coordinates with trainable offsets. Then, location-aware losses are established by encouraging pixels to move towards well-classified locations. An LaU is offset prediction coupled with interpolation, which is trained end-to-end to generate confidence score at each position from coarse to fine. Guided by location-aware losses, the new module can replace its plain counterpart (\textit{e.g.}, bilinear upsampling) in a plug-and-play manner to further boost the leading encoder-decoder approaches. Extensive experiments validate the consistent improvement over the state-of-the-art methods on benchmark datasets. Our code is available at https://github.com/HolmesShuan/Location-aware-Upsampling-for-Semantic-Segmentation
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ADE20K val | mIoU | 45.02 | LaU-regression-loss |
| Semantic Segmentation | PASCAL Context | mIoU | 53.9 | LaU-regression-loss (ResNet-101) |
| Semantic Segmentation | ADE20K | Test Score | 56.32 | LaU-regression-loss |
| Semantic Segmentation | ADE20K | Validation mIoU | 45.02 | LaU-regression-loss |
| Semantic Segmentation | ADE20K | Test Score | 56.41 | LaU-offset-loss |
| Semantic Segmentation | ADE20K | Validation mIoU | 44.55 | LaU-offset-loss |
| 10-shot image generation | ADE20K val | mIoU | 45.02 | LaU-regression-loss |
| 10-shot image generation | PASCAL Context | mIoU | 53.9 | LaU-regression-loss (ResNet-101) |
| 10-shot image generation | ADE20K | Test Score | 56.32 | LaU-regression-loss |
| 10-shot image generation | ADE20K | Validation mIoU | 45.02 | LaU-regression-loss |
| 10-shot image generation | ADE20K | Test Score | 56.41 | LaU-offset-loss |
| 10-shot image generation | ADE20K | Validation mIoU | 44.55 | LaU-offset-loss |