TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Intra-Batch Supervision for Panoptic Segmentation on High-...

Intra-Batch Supervision for Panoptic Segmentation on High-Resolution Images

Daan de Geus, Gijs Dubbelman

2023-04-17IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023 1Panoptic Segmentation
PaperPDFCodeCode

Abstract

Unified panoptic segmentation methods are achieving state-of-the-art results on several datasets. To achieve these results on high-resolution datasets, these methods apply crop-based training. In this work, we find that, although crop-based training is advantageous in general, it also has a harmful side-effect. Specifically, it limits the ability of unified networks to discriminate between large object instances, causing them to make predictions that are confused between multiple instances. To solve this, we propose Intra-Batch Supervision (IBS), which improves a network's ability to discriminate between instances by introducing additional supervision using multiple images from the same batch. We show that, with our IBS, we successfully address the confusion problem and consistently improve the performance of unified networks. For the high-resolution Cityscapes and Mapillary Vistas datasets, we achieve improvements of up to +2.5 on the Panoptic Quality for thing classes, and even more considerable gains of up to +5.8 on both the pixel accuracy and pixel precision, which we identify as better metrics to capture the confusion problem.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapes valPQ62.4Mask2Former + Intra-Batch Supervision (ResNet-50)
Semantic SegmentationCityscapes valPQst67.3Mask2Former + Intra-Batch Supervision (ResNet-50)
Semantic SegmentationCityscapes valPQth54.7Mask2Former + Intra-Batch Supervision (ResNet-50)
Semantic SegmentationMapillary valPQ42.2Mask2Former + Intra-Batch Supervision (ResNet-50)
Semantic SegmentationMapillary valPQst52Mask2Former + Intra-Batch Supervision (ResNet-50)
Semantic SegmentationMapillary valPQth34.9Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationCityscapes valPQ62.4Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationCityscapes valPQst67.3Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationCityscapes valPQth54.7Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationMapillary valPQ42.2Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationMapillary valPQst52Mask2Former + Intra-Batch Supervision (ResNet-50)
10-shot image generationMapillary valPQth34.9Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationCityscapes valPQ62.4Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationCityscapes valPQst67.3Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationCityscapes valPQth54.7Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationMapillary valPQ42.2Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationMapillary valPQst52Mask2Former + Intra-Batch Supervision (ResNet-50)
Panoptic SegmentationMapillary valPQth34.9Mask2Former + Intra-Batch Supervision (ResNet-50)

Related Papers

DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation2025-07-14OpenWorldSAM: Extending SAM2 for Universal Image Segmentation with Language Prompts2025-07-07HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation2025-06-26PanSt3R: Multi-view Consistent Panoptic Segmentation2025-06-26Open-Set LiDAR Panoptic Segmentation Guided by Uncertainty-Aware Learning2025-06-16A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects2025-06-16The Missing Point in Vision Transformers for Universal Image Segmentation2025-05-26How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation2025-05-25