CrOC: Cross-View Online Clustering for Dense Visual Representation Learning

Thomas Stegmüller, Tim Lebailly, Behzad Bozorgtabar, Tinne Tuytelaars, Jean-Philippe Thiran

2023-03-23CVPR 2023 1Online Clustering Representation Learning Unsupervised Semantic Segmentation Segmentation Semantic Segmentation Clustering Video Object Segmentation Video Semantic Segmentation

Paper PDF Code Code(official)

Abstract

Learning dense visual representations without labels is an arduous task and more so from scene-centric data. We propose to tackle this challenging problem by proposing a Cross-view consistency objective with an Online Clustering mechanism (CrOC) to discover and segment the semantics of the views. In the absence of hand-crafted priors, the resulting method is more generalizable and does not require a cumbersome pre-processing step. More importantly, the clustering algorithm conjointly operates on the features of both views, thereby elegantly bypassing the issue of content not represented in both views and the ambiguous matching of objects from one crop to the other. We demonstrate excellent performance on linear and unsupervised segmentation transfer tasks on various datasets and similarly for video object segmentation. Our code and pre-trained models are publicly available at https://github.com/stegmuel/CrOC.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	COCO-Stuff-27	Clustering [mIoU]	21.9	CrOC (ViT-S/16, COCO+)
Unsupervised Semantic Segmentation	COCO-Stuff-27	Clustering [mIoU]	21.9	CrOC (ViT-S/16, COCO+)
10-shot image generation	COCO-Stuff-27	Clustering [mIoU]	21.9	CrOC (ViT-S/16, COCO+)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21 Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20 Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17 Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17 DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17 From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17