Iterative Few-shot Semantic Segmentation from Image Label Text

Haohan Wang, Liang Liu, Wuhao Zhang, Jiangning Zhang, Zhenye Gan, Yabiao Wang, Chengjie Wang, Haoqian Wang

2023-03-10Few-Shot Semantic Segmentation Semantic Segmentation Language Modelling

Abstract

Few-shot semantic segmentation aims to learn to segment unseen class objects with the guidance of only a few support images. Most previous methods rely on the pixel-level label of support images. In this paper, we focus on a more challenging setting, in which only the image-level labels are available. We propose a general framework to firstly generate coarse masks with the help of the powerful vision-language model CLIP, and then iteratively and mutually refine the mask predictions of support and query images. Extensive experiments on PASCAL-5i and COCO-20i datasets demonstrate that our method not only outperforms the state-of-the-art weakly supervised approaches by a significant margin, but also achieves comparable or better results to recent supervised methods. Moreover, our method owns an excellent generalization ability for the images in the wild and uncommon classes. Code will be available at https://github.com/Whileherham/IMR-HSNet.

Results

Task	Dataset	Metric	Value	Model
Few-Shot Learning	COCO-20i (5-shot)	Mean IoU	44.4	IMR-HSNet (ResNet-50)
Few-Shot Learning	PASCAL-5i (1-Shot)	Mean IoU	61.1	IMR-HSNet (ResNet-50)
Few-Shot Learning	PASCAL-5i (1-Shot)	Mean IoU	56.5	IMR-HSNet (VGG-16)
Few-Shot Learning	COCO-20i (1-shot)	Mean IoU	42.4	IMR-HSNet (ResNet-50)
Few-Shot Learning	COCO-20i (1-shot)	Mean IoU	37.7	IIMR-HSNet (VGG-16)
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	Mean IoU	44.4	IMR-HSNet (ResNet-50)
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	Mean IoU	61.1	IMR-HSNet (ResNet-50)
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	Mean IoU	56.5	IMR-HSNet (VGG-16)
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	Mean IoU	42.4	IMR-HSNet (ResNet-50)
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	Mean IoU	37.7	IIMR-HSNet (VGG-16)
Meta-Learning	COCO-20i (5-shot)	Mean IoU	44.4	IMR-HSNet (ResNet-50)
Meta-Learning	PASCAL-5i (1-Shot)	Mean IoU	61.1	IMR-HSNet (ResNet-50)
Meta-Learning	PASCAL-5i (1-Shot)	Mean IoU	56.5	IMR-HSNet (VGG-16)
Meta-Learning	COCO-20i (1-shot)	Mean IoU	42.4	IMR-HSNet (ResNet-50)
Meta-Learning	COCO-20i (1-shot)	Mean IoU	37.7	IIMR-HSNet (VGG-16)

Iterative Few-shot Semantic Segmentation from Image Label Text

Abstract

Results

Related Papers

Iterative Few-shot Semantic Segmentation from Image Label Text

Abstract

Results

Related Papers