TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Drive&Segment: Unsupervised Semantic Segmentation of Urban...

Drive&Segment: Unsupervised Semantic Segmentation of Urban Scenes via Cross-modal Distillation

Antonin Vobecky, David Hurych, Oriane Siméoni, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Josef Sivic

2022-03-21Unsupervised Semantic SegmentationSegmentationSemantic SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

This work investigates learning pixel-wise semantic image segmentation in urban scenes without any manual annotation, just from the raw non-curated data collected by cars which, equipped with cameras and LiDAR sensors, drive around a city. Our contributions are threefold. First, we propose a novel method for cross-modal unsupervised learning of semantic image segmentation by leveraging synchronized LiDAR and image data. The key ingredient of our method is the use of an object proposal module that analyzes the LiDAR point cloud to obtain proposals for spatially consistent objects. Second, we show that these 3D object proposals can be aligned with the input images and reliably clustered into semantically meaningful pseudo-classes. Finally, we develop a cross-modal distillation approach that leverages image data partially annotated with the resulting pseudo-classes to train a transformer-based model for image semantic segmentation. We show the generalization capabilities of our method by testing on four different testing datasets (Cityscapes, Dark Zurich, Nighttime Driving and ACDC) without any finetuning, and demonstrate significant improvements compared to the current state of the art on this problem. See project webpage https://vobecant.github.io/DriveAndSegment/ for the code and more.

Results

TaskDatasetMetricValueModel
Semantic SegmentationACDC (Adverse Conditions Dataset with Correspondences)mIoU16.7Segmenter ViT-S/16
Semantic SegmentationNighttime DrivingmIoU18.9Segmenter ViT-S/16
Semantic SegmentationDark ZurichmIoU14.2Segmenter ViT-S/16
Semantic SegmentationCityscapes valmIoU21.8Segmenter ViT-S/16
Unsupervised Semantic SegmentationACDC (Adverse Conditions Dataset with Correspondences)mIoU16.7Segmenter ViT-S/16
Unsupervised Semantic SegmentationNighttime DrivingmIoU18.9Segmenter ViT-S/16
Unsupervised Semantic SegmentationDark ZurichmIoU14.2Segmenter ViT-S/16
Unsupervised Semantic SegmentationCityscapes valmIoU21.8Segmenter ViT-S/16
10-shot image generationACDC (Adverse Conditions Dataset with Correspondences)mIoU16.7Segmenter ViT-S/16
10-shot image generationNighttime DrivingmIoU18.9Segmenter ViT-S/16
10-shot image generationDark ZurichmIoU14.2Segmenter ViT-S/16
10-shot image generationCityscapes valmIoU21.8Segmenter ViT-S/16

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17