TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/K-Net: Towards Unified Image Segmentation

K-Net: Towards Unified Image Segmentation

Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy

2021-06-28NeurIPS 2021 12Panoptic SegmentationSegmentationSemantic SegmentationInstance SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

Semantic, instance, and panoptic segmentations have been addressed using different and specialized frameworks despite their underlying connections. This paper presents a unified, simple, and effective framework for these essentially similar tasks. The framework, named K-Net, segments both instances and semantic categories consistently by a group of learnable kernels, where each kernel is responsible for generating a mask for either a potential instance or a stuff class. To remedy the difficulties of distinguishing various instances, we propose a kernel update strategy that enables each kernel dynamic and conditional on its meaningful group in the input image. K-Net can be trained in an end-to-end manner with bipartite matching, and its training and inference are naturally NMS-free and box-free. Without bells and whistles, K-Net surpasses all previous published state-of-the-art single-model results of panoptic segmentation on MS COCO test-dev split and semantic segmentation on ADE20K val split with 55.2% PQ and 54.3% mIoU, respectively. Its instance segmentation performance is also on par with Cascade Mask R-CNN on MS COCO with 60%-90% faster inference speeds. Code and models will be released at https://github.com/ZwwWayne/K-Net/.

Results

TaskDatasetMetricValueModel
Semantic SegmentationADE20K valmIoU54.3K-Net
Semantic SegmentationADE20KValidation mIoU54.3K-Net
Semantic SegmentationCOCO test-devPQ55.2K-Net (Swin-L)
Semantic SegmentationCOCO test-devPQst46.2K-Net (Swin-L)
Semantic SegmentationCOCO test-devPQth61.2K-Net (Swin-L)
Semantic SegmentationCOCO test-devPQ48.3K-Net (R101-FPN-DCN)
Semantic SegmentationCOCO test-devPQst39.7K-Net (R101-FPN-DCN)
Semantic SegmentationCOCO test-devPQth54K-Net (R101-FPN-DCN)
Instance SegmentationCOCO test-devAP5063.3K-Net-N256 (ResNet-101)
Instance SegmentationCOCO test-devAPL59K-Net-N256 (ResNet-101)
Instance SegmentationCOCO test-devAPM43.3K-Net-N256 (ResNet-101)
Instance SegmentationCOCO test-devAPS18.8K-Net-N256 (ResNet-101)
Instance SegmentationCOCO test-devAP5062.8K-Net (ResNet-101)
Instance SegmentationCOCO test-devAPL58.8K-Net (ResNet-101)
Instance SegmentationCOCO test-devAPM42.7K-Net (ResNet-101)
Instance SegmentationCOCO test-devAPS18.7K-Net (ResNet-101)
10-shot image generationADE20K valmIoU54.3K-Net
10-shot image generationADE20KValidation mIoU54.3K-Net
10-shot image generationCOCO test-devPQ55.2K-Net (Swin-L)
10-shot image generationCOCO test-devPQst46.2K-Net (Swin-L)
10-shot image generationCOCO test-devPQth61.2K-Net (Swin-L)
10-shot image generationCOCO test-devPQ48.3K-Net (R101-FPN-DCN)
10-shot image generationCOCO test-devPQst39.7K-Net (R101-FPN-DCN)
10-shot image generationCOCO test-devPQth54K-Net (R101-FPN-DCN)
Panoptic SegmentationCOCO test-devPQ55.2K-Net (Swin-L)
Panoptic SegmentationCOCO test-devPQst46.2K-Net (Swin-L)
Panoptic SegmentationCOCO test-devPQth61.2K-Net (Swin-L)
Panoptic SegmentationCOCO test-devPQ48.3K-Net (R101-FPN-DCN)
Panoptic SegmentationCOCO test-devPQst39.7K-Net (R101-FPN-DCN)
Panoptic SegmentationCOCO test-devPQth54K-Net (R101-FPN-DCN)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17