Large-scale Unsupervised Semantic Segmentation

ShangHua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip Torr

2021-06-06Representation Learning Unsupervised Semantic Segmentation Segmentation Semantic Segmentation

Abstract

Empowered by large datasets, e.g., ImageNet, unsupervised learning on large-scale data has enabled significant advances for classification tasks. However, whether the large-scale unsupervised semantic segmentation can be achieved remains unknown. There are two major challenges: i) we need a large-scale benchmark for assessing algorithms; ii) we need to develop methods to simultaneously learn category and shape representation in an unsupervised manner. In this work, we propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to help the research progress. Building on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 50k high-quality semantic segmentation annotations for evaluation. Our benchmark has a high data diversity and a clear task objective. We also present a simple yet effective method that works surprisingly well for LUSS. In addition, we benchmark related un/weakly/fully supervised methods accordingly, identifying the challenges and possible directions of LUSS. The benchmark and source code is publicly available at https://github.com/LUSSeg.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	ImageNet-S	mIoU (test)	20.8	PASS (ResNet-50 D16, 224x224, LUSS)
Semantic Segmentation	ImageNet-S	mIoU (val)	21.6	PASS (ResNet-50 D16, 224x224, LUSS)
Semantic Segmentation	ImageNet-S	mIoU (test)	20.3	PASS (ResNet-50 D32, 224x224, LUSS)
Semantic Segmentation	ImageNet-S	mIoU (val)	21	PASS (ResNet-50 D32, 224x224, LUSS)
Semantic Segmentation	ImageNet-S	mIoU (test)	11	PASS
Semantic Segmentation	ImageNet-S	mIoU (val)	11.5	PASS
Semantic Segmentation	ImageNet-S-50	mIoU (test)	42.3	PASS (+Saliency map)
Semantic Segmentation	ImageNet-S-50	mIoU (val)	43.3	PASS (+Saliency map)
Semantic Segmentation	ImageNet-S-50	mIoU (test)	32	PASS
Semantic Segmentation	ImageNet-S-50	mIoU (val)	32.4	PASS
Semantic Segmentation	ImageNet-S-300	mIoU (test)	18.1	PASS
Semantic Segmentation	ImageNet-S-300	mIoU (val)	18	PASS
Unsupervised Semantic Segmentation	ImageNet-S	mIoU (test)	11	PASS
Unsupervised Semantic Segmentation	ImageNet-S	mIoU (val)	11.5	PASS
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (test)	42.3	PASS (+Saliency map)
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (val)	43.3	PASS (+Saliency map)
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (test)	32	PASS
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (val)	32.4	PASS
Unsupervised Semantic Segmentation	ImageNet-S-300	mIoU (test)	18.1	PASS
Unsupervised Semantic Segmentation	ImageNet-S-300	mIoU (val)	18	PASS
10-shot image generation	ImageNet-S	mIoU (test)	20.8	PASS (ResNet-50 D16, 224x224, LUSS)
10-shot image generation	ImageNet-S	mIoU (val)	21.6	PASS (ResNet-50 D16, 224x224, LUSS)
10-shot image generation	ImageNet-S	mIoU (test)	20.3	PASS (ResNet-50 D32, 224x224, LUSS)
10-shot image generation	ImageNet-S	mIoU (val)	21	PASS (ResNet-50 D32, 224x224, LUSS)
10-shot image generation	ImageNet-S	mIoU (test)	11	PASS
10-shot image generation	ImageNet-S	mIoU (val)	11.5	PASS
10-shot image generation	ImageNet-S-50	mIoU (test)	42.3	PASS (+Saliency map)
10-shot image generation	ImageNet-S-50	mIoU (val)	43.3	PASS (+Saliency map)
10-shot image generation	ImageNet-S-50	mIoU (test)	32	PASS
10-shot image generation	ImageNet-S-50	mIoU (val)	32.4	PASS
10-shot image generation	ImageNet-S-300	mIoU (test)	18.1	PASS
10-shot image generation	ImageNet-S-300	mIoU (val)	18	PASS

Large-scale Unsupervised Semantic Segmentation

Abstract

Results

Related Papers

Large-scale Unsupervised Semantic Segmentation

Abstract

Results

Related Papers