Autoregressive Unsupervised Image Segmentation

Yassine Ouali, Céline Hudelot, Myriam Tami

2020-07-16ECCV 2020 8Unsupervised Image Segmentation Representation Learning Unsupervised Semantic Segmentation Segmentation Semantic Segmentation Clustering Image Segmentation

Paper PDF Code

Abstract

In this work, we propose a new unsupervised image segmentation approach based on mutual information maximization between different constructed views of the inputs. Taking inspiration from autoregressive generative models that predict the current pixel from past pixels in a raster-scan ordering created with masked convolutions, we propose to use different orderings over the inputs using various forms of masked convolutions to construct different views of the data. For a given input, the model produces a pair of predictions with two valid orderings, and is then trained to maximize the mutual information between the two outputs. These outputs can either be low-dimensional features for representation learning or output clusters corresponding to semantic labels for clustering. While masked convolutions are used during training, in inference, no masking is applied and we fall back to the standard convolution where the model has access to the full input. The proposed method outperforms current state-of-the-art on unsupervised image segmentation. It is simple and easy to implement, and can be extended to other visual tasks and integrated seamlessly into existing unsupervised learning methods requiring different views of the data.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	COCO-Stuff-3	Pixel Accuracy	72.9	AC
Unsupervised Semantic Segmentation	COCO-Stuff-3	Pixel Accuracy	72.9	AC
10-shot image generation	COCO-Stuff-3	Pixel Accuracy	72.9	AC

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21 Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20 Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17 Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17 DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17 From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17