PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

Jang Hyun Cho, Utkarsh Mall, Kavita Bala, Bharath Hariharan

2021-03-30CVPR 2021 1Unsupervised Semantic Segmentation Semantic Segmentation Clustering

Abstract

We present a new framework for semantic segmentation without annotations via clustering. Off-the-shelf clustering methods are limited to curated, single-label, and object-centric images yet real-world data are dominantly uncurated, multi-label, and scene-centric. We extend clustering from images to pixels and assign separate cluster membership to different instances within each image. However, solely relying on pixel-wise feature similarity fails to learn high-level semantic concepts and overfits to low-level visual cues. We propose a method to incorporate geometric consistency as an inductive bias to learn invariance and equivariance for photometric and geometric variations. With our novel learning objective, our framework can learn high-level semantic concepts. Our method, PiCIE (Pixel-level feature Clustering using Invariance and Equivariance), is the first method capable of segmenting both things and stuff categories without any hyperparameter tuning or task-specific pre-processing. Our method largely outperforms existing baselines on COCO and Cityscapes with +17.5 Acc. and +4.5 mIoU. We show that PiCIE gives a better initialization for standard supervised training. The code is available at https://github.com/janghyuncho/PiCIE.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	ImageNet-S-50	mIoU (test)	17.6	PiCIE (Supervised pretrain)
Semantic Segmentation	ImageNet-S-50	mIoU (val)	17.8	PiCIE (Supervised pretrain)
Semantic Segmentation	COCO-Stuff-171	Pixel Accuracy	29.8	PiCIE (ResNet-50)
Semantic Segmentation	COCO-Stuff-171	mIoU	5.6	PiCIE (ResNet-50)
Semantic Segmentation	Cityscapes test	Accuracy	65.5	PiCIE
Semantic Segmentation	Cityscapes test	mIoU	12.3	PiCIE
Semantic Segmentation	COCO-Stuff-27	Clustering [Accuracy]	49.99	PiCIE + H
Semantic Segmentation	COCO-Stuff-27	Clustering [mIoU]	14.36	PiCIE + H
Semantic Segmentation	COCO-Stuff-27	Clustering [Accuracy]	48.1	PiCIE
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (test)	17.6	PiCIE (Supervised pretrain)
Unsupervised Semantic Segmentation	ImageNet-S-50	mIoU (val)	17.8	PiCIE (Supervised pretrain)
Unsupervised Semantic Segmentation	COCO-Stuff-171	Pixel Accuracy	29.8	PiCIE (ResNet-50)
Unsupervised Semantic Segmentation	COCO-Stuff-171	mIoU	5.6	PiCIE (ResNet-50)
Unsupervised Semantic Segmentation	Cityscapes test	Accuracy	65.5	PiCIE
Unsupervised Semantic Segmentation	Cityscapes test	mIoU	12.3	PiCIE
Unsupervised Semantic Segmentation	COCO-Stuff-27	Clustering [Accuracy]	49.99	PiCIE + H
Unsupervised Semantic Segmentation	COCO-Stuff-27	Clustering [mIoU]	14.36	PiCIE + H
Unsupervised Semantic Segmentation	COCO-Stuff-27	Clustering [Accuracy]	48.1	PiCIE
10-shot image generation	ImageNet-S-50	mIoU (test)	17.6	PiCIE (Supervised pretrain)
10-shot image generation	ImageNet-S-50	mIoU (val)	17.8	PiCIE (Supervised pretrain)
10-shot image generation	COCO-Stuff-171	Pixel Accuracy	29.8	PiCIE (ResNet-50)
10-shot image generation	COCO-Stuff-171	mIoU	5.6	PiCIE (ResNet-50)
10-shot image generation	Cityscapes test	Accuracy	65.5	PiCIE
10-shot image generation	Cityscapes test	mIoU	12.3	PiCIE
10-shot image generation	COCO-Stuff-27	Clustering [Accuracy]	49.99	PiCIE + H
10-shot image generation	COCO-Stuff-27	Clustering [mIoU]	14.36	PiCIE + H
10-shot image generation	COCO-Stuff-27	Clustering [Accuracy]	48.1	PiCIE

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

Abstract

Results

Related Papers

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in Clustering

Abstract

Results

Related Papers