TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Improving Semantic Segmentation via Video Propagation and ...

Improving Semantic Segmentation via Video Propagation and Label Relaxation

Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn Newsam, Andrew Tao, Bryan Catanzaro

2018-12-04CVPR 2019 6SegmentationSemantic Segmentation
PaperPDFCodeCodeCodeCodeCode

Abstract

Semantic segmentation requires large amounts of pixel-wise annotations to learn accurate models. In this paper, we present a video prediction-based methodology to scale up training sets by synthesizing new training samples in order to improve the accuracy of semantic segmentation networks. We exploit video prediction models' ability to predict future frames in order to also predict future labels. A joint propagation strategy is also proposed to alleviate mis-alignments in synthesized samples. We demonstrate that training segmentation models on datasets augmented by the synthesized samples leads to significant improvements in accuracy. Furthermore, we introduce a novel boundary label relaxation technique that makes training robust to annotation noise and propagation artifacts along object boundaries. Our proposed methods achieve state-of-the-art mIoUs of 83.5% on Cityscapes and 82.9% on CamVid. Our single model, without model ensembles, achieves 72.8% mIoU on the KITTI semantic segmentation test set, which surpasses the winning entry of the ROB challenge 2018. Our code and videos can be found at https://nv-adlr.github.io/publication/2018-Segmentation.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCamVidMean IoU81.7DeepLabV3Plus + SDCNetAug
Semantic SegmentationKITTI Semantic SegmentationCategory IoU88.99DeepLabV3Plus + SDCNetAug
Semantic SegmentationKITTI Semantic SegmentationCategory iIoU75.26DeepLabV3Plus + SDCNetAug
Semantic SegmentationKITTI Semantic SegmentationMean IoU (class)72.83DeepLabV3Plus + SDCNetAug
Semantic SegmentationKITTI Semantic Segmentationclass iIoU48.68DeepLabV3Plus + SDCNetAug
10-shot image generationCamVidMean IoU81.7DeepLabV3Plus + SDCNetAug
10-shot image generationKITTI Semantic SegmentationCategory IoU88.99DeepLabV3Plus + SDCNetAug
10-shot image generationKITTI Semantic SegmentationCategory iIoU75.26DeepLabV3Plus + SDCNetAug
10-shot image generationKITTI Semantic SegmentationMean IoU (class)72.83DeepLabV3Plus + SDCNetAug
10-shot image generationKITTI Semantic Segmentationclass iIoU48.68DeepLabV3Plus + SDCNetAug

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17