Seamless Scene Segmentation

Lorenzo Porzi, Samuel Rota Bulò, Aleksander Colovic, Peter Kontschieder

2019-05-03CVPR 2019 6Panoptic Segmentation Scene Segmentation Segmentation Semantic Segmentation

Abstract

In this work we introduce a novel, CNN-based architecture that can be trained end-to-end to deliver seamless scene segmentation results. Our goal is to predict consistent semantic segmentation and detection results by means of a panoptic output format, going beyond the simple combination of independently trained segmentation and detection models. The proposed architecture takes advantage of a novel segmentation head that seamlessly integrates multi-scale features generated by a Feature Pyramid Network with contextual information conveyed by a light-weight DeepLab-like module. As additional contribution we review the panoptic metric and propose an alternative that overcomes its limitations when evaluating non-instance categories. Our proposed network architecture yields state-of-the-art results on three challenging street-level datasets, i.e. Cityscapes, Indian Driving Dataset and Mapillary Vistas.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	KITTI Panoptic Segmentation	PQ	42.2	Seamless
Semantic Segmentation	Indian Driving Dataset	PQ	48.5	Seamless
10-shot image generation	KITTI Panoptic Segmentation	PQ	42.2	Seamless
10-shot image generation	Indian Driving Dataset	PQ	48.5	Seamless
Panoptic Segmentation	KITTI Panoptic Segmentation	PQ	42.2	Seamless
Panoptic Segmentation	Indian Driving Dataset	PQ	48.5	Seamless

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21 Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17 DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17 From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17 Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17 SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17 Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17 A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17