TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TransFGU: A Top-down Approach to Fine-Grained Unsupervised...

TransFGU: A Top-down Approach to Fine-Grained Unsupervised Semantic Segmentation

Zhaoyuan Yin, Pichao Wang, Fan Wang, Xianzhe Xu, Hanling Zhang, Hao Li, Rong Jin

2021-12-02Self-Supervised LearningUnsupervised Semantic SegmentationSegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

Unsupervised semantic segmentation aims to obtain high-level semantic representation on low-level visual features without manual annotations. Most existing methods are bottom-up approaches that try to group pixels into regions based on their visual cues or certain predefined rules. As a result, it is difficult for these bottom-up approaches to generate fine-grained semantic segmentation when coming to complicated scenes with multiple objects and some objects sharing similar visual appearance. In contrast, we propose the first top-down unsupervised semantic segmentation framework for fine-grained segmentation in extremely complicated scenarios. Specifically, we first obtain rich high-level structured semantic concept information from large-scale vision data in a self-supervised learning manner, and use such information as a prior to discover potential semantic categories presented in target datasets. Secondly, the discovered high-level semantic categories are mapped to low-level pixel features by calculating the class activate map (CAM) with respect to certain discovered semantic representation. Lastly, the obtained CAMs serve as pseudo labels to train the segmentation module and produce the final semantic segmentation. Experimental results on multiple semantic segmentation benchmarks show that our top-down unsupervised segmentation is robust to both object-centric and scene-centric datasets under different semantic granularity levels, and outperforms all the current state-of-the-art bottom-up methods. Our code is available at \url{https://github.com/damo-cv/TransFGU}.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCOCO-Stuff-81Pixel Accuracy64.3TransFGU (ViT-S/8)
Semantic SegmentationCOCO-Stuff-81mIoU12.7TransFGU (ViT-S/8)
Semantic SegmentationCOCO-Stuff-171Pixel Accuracy34.32TransFGU (ViT-S/8)
Semantic SegmentationCOCO-Stuff-171mIoU11.93TransFGU (ViT-S/8)
Unsupervised Semantic SegmentationCOCO-Stuff-81Pixel Accuracy64.3TransFGU (ViT-S/8)
Unsupervised Semantic SegmentationCOCO-Stuff-81mIoU12.7TransFGU (ViT-S/8)
Unsupervised Semantic SegmentationCOCO-Stuff-171Pixel Accuracy34.32TransFGU (ViT-S/8)
Unsupervised Semantic SegmentationCOCO-Stuff-171mIoU11.93TransFGU (ViT-S/8)
10-shot image generationCOCO-Stuff-81Pixel Accuracy64.3TransFGU (ViT-S/8)
10-shot image generationCOCO-Stuff-81mIoU12.7TransFGU (ViT-S/8)
10-shot image generationCOCO-Stuff-171Pixel Accuracy34.32TransFGU (ViT-S/8)
10-shot image generationCOCO-Stuff-171mIoU11.93TransFGU (ViT-S/8)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17