TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Large-scale Unsupervised Semantic Segmentation

Large-scale Unsupervised Semantic Segmentation

ShangHua Gao, Zhong-Yu Li, Ming-Hsuan Yang, Ming-Ming Cheng, Junwei Han, Philip Torr

2021-06-06Representation LearningUnsupervised Semantic SegmentationSegmentationSemantic Segmentation
PaperPDFCode(official)CodeCode

Abstract

Empowered by large datasets, e.g., ImageNet, unsupervised learning on large-scale data has enabled significant advances for classification tasks. However, whether the large-scale unsupervised semantic segmentation can be achieved remains unknown. There are two major challenges: i) we need a large-scale benchmark for assessing algorithms; ii) we need to develop methods to simultaneously learn category and shape representation in an unsupervised manner. In this work, we propose a new problem of large-scale unsupervised semantic segmentation (LUSS) with a newly created benchmark dataset to help the research progress. Building on the ImageNet dataset, we propose the ImageNet-S dataset with 1.2 million training images and 50k high-quality semantic segmentation annotations for evaluation. Our benchmark has a high data diversity and a clear task objective. We also present a simple yet effective method that works surprisingly well for LUSS. In addition, we benchmark related un/weakly/fully supervised methods accordingly, identifying the challenges and possible directions of LUSS. The benchmark and source code is publicly available at https://github.com/LUSSeg.

Results

TaskDatasetMetricValueModel
Semantic SegmentationImageNet-SmIoU (test)20.8PASS (ResNet-50 D16, 224x224, LUSS)
Semantic SegmentationImageNet-SmIoU (val)21.6PASS (ResNet-50 D16, 224x224, LUSS)
Semantic SegmentationImageNet-SmIoU (test)20.3PASS (ResNet-50 D32, 224x224, LUSS)
Semantic SegmentationImageNet-SmIoU (val)21PASS (ResNet-50 D32, 224x224, LUSS)
Semantic SegmentationImageNet-SmIoU (test)11PASS
Semantic SegmentationImageNet-SmIoU (val)11.5PASS
Semantic SegmentationImageNet-S-50mIoU (test)42.3PASS (+Saliency map)
Semantic SegmentationImageNet-S-50mIoU (val)43.3PASS (+Saliency map)
Semantic SegmentationImageNet-S-50mIoU (test)32PASS
Semantic SegmentationImageNet-S-50mIoU (val)32.4PASS
Semantic SegmentationImageNet-S-300mIoU (test)18.1PASS
Semantic SegmentationImageNet-S-300mIoU (val)18PASS
Unsupervised Semantic SegmentationImageNet-SmIoU (test)11PASS
Unsupervised Semantic SegmentationImageNet-SmIoU (val)11.5PASS
Unsupervised Semantic SegmentationImageNet-S-50mIoU (test)42.3PASS (+Saliency map)
Unsupervised Semantic SegmentationImageNet-S-50mIoU (val)43.3PASS (+Saliency map)
Unsupervised Semantic SegmentationImageNet-S-50mIoU (test)32PASS
Unsupervised Semantic SegmentationImageNet-S-50mIoU (val)32.4PASS
Unsupervised Semantic SegmentationImageNet-S-300mIoU (test)18.1PASS
Unsupervised Semantic SegmentationImageNet-S-300mIoU (val)18PASS
10-shot image generationImageNet-SmIoU (test)20.8PASS (ResNet-50 D16, 224x224, LUSS)
10-shot image generationImageNet-SmIoU (val)21.6PASS (ResNet-50 D16, 224x224, LUSS)
10-shot image generationImageNet-SmIoU (test)20.3PASS (ResNet-50 D32, 224x224, LUSS)
10-shot image generationImageNet-SmIoU (val)21PASS (ResNet-50 D32, 224x224, LUSS)
10-shot image generationImageNet-SmIoU (test)11PASS
10-shot image generationImageNet-SmIoU (val)11.5PASS
10-shot image generationImageNet-S-50mIoU (test)42.3PASS (+Saliency map)
10-shot image generationImageNet-S-50mIoU (val)43.3PASS (+Saliency map)
10-shot image generationImageNet-S-50mIoU (test)32PASS
10-shot image generationImageNet-S-50mIoU (val)32.4PASS
10-shot image generationImageNet-S-300mIoU (test)18.1PASS
10-shot image generationImageNet-S-300mIoU (val)18PASS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17