Puzzle-CAM: Improved localization via matching partial and full features

Sanghyun Jo, In-Jae Yu

2021-01-27Weakly-Supervised Semantic Segmentation Weakly supervised Semantic Segmentation Segmentation Semantic Segmentation

Paper PDF Code(official)Code Code Code

Abstract

Weakly-supervised semantic segmentation (WSSS) is introduced to narrow the gap for semantic segmentation performance from pixel-level supervision to image-level supervision. Most advanced approaches are based on class activation maps (CAMs) to generate pseudo-labels to train the segmentation network. The main limitation of WSSS is that the process of generating pseudo-labels from CAMs that use an image classifier is mainly focused on the most discriminative parts of the objects. To address this issue, we propose Puzzle-CAM, a process that minimizes differences between the features from separate patches and the whole image. Our method consists of a puzzle module and two regularization terms to discover the most integrated region in an object. Puzzle-CAM can activate the overall region of an object using image-level supervision without requiring extra parameters. % In experiments, Puzzle-CAM outperformed previous state-of-the-art methods using the same labels for supervision on the PASCAL VOC 2012 test dataset. In experiments, Puzzle-CAM outperformed previous state-of-the-art methods using the same labels for supervision on the PASCAL VOC 2012 dataset. Code associated with our experiments is available at https://github.com/OFRIN/PuzzleCAM.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	PASCAL VOC 2012 val	Mean IoU	71.9	Puzzle-CAM (ResNeSt-269)
Semantic Segmentation	PASCAL VOC 2012 val	Mean IoU	66.9	Puzzle-CAM (ResNeSt-101)
Semantic Segmentation	PASCAL VOC 2012 test	Mean IoU	72.2	Puzzle-CAM (ResNeSt-269)
Semantic Segmentation	PASCAL VOC 2012 test	Mean IoU	67.7	Puzzle-CAM (ResNeSt-101)
10-shot image generation	PASCAL VOC 2012 val	Mean IoU	71.9	Puzzle-CAM (ResNeSt-269)
10-shot image generation	PASCAL VOC 2012 val	Mean IoU	66.9	Puzzle-CAM (ResNeSt-101)
10-shot image generation	PASCAL VOC 2012 test	Mean IoU	72.2	Puzzle-CAM (ResNeSt-269)
10-shot image generation	PASCAL VOC 2012 test	Mean IoU	67.7	Puzzle-CAM (ResNeSt-101)

Puzzle-CAM: Improved localization via matching partial and full features

Abstract

Results

Related Papers

Puzzle-CAM: Improved localization via matching partial and full features

Abstract

Results

Related Papers