TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PIDNet: A Real-time Semantic Segmentation Network Inspired...

PIDNet: A Real-time Semantic Segmentation Network Inspired by PID Controllers

Jiacong Xu, Zixiang Xiong, Shankar P. Bhattacharyya

2022-06-04CVPR 2023 1Real-Time Semantic SegmentationSemantic Segmentation
PaperPDFCodeCodeCodeCodeCodeCode(official)

Abstract

Two-branch network architecture has shown its efficiency and effectiveness in real-time semantic segmentation tasks. However, direct fusion of high-resolution details and low-frequency context has the drawback of detailed features being easily overwhelmed by surrounding contextual information. This overshoot phenomenon limits the improvement of the segmentation accuracy of existing two-branch models. In this paper, we make a connection between Convolutional Neural Networks (CNN) and Proportional-Integral-Derivative (PID) controllers and reveal that a two-branch network is equivalent to a Proportional-Integral (PI) controller, which inherently suffers from similar overshoot issues. To alleviate this problem, we propose a novel three-branch network architecture: PIDNet, which contains three branches to parse detailed, context and boundary information, respectively, and employs boundary attention to guide the fusion of detailed and context branches. Our family of PIDNets achieve the best trade-off between inference speed and accuracy and their accuracy surpasses all the existing models with similar inference speed on the Cityscapes and CamVid datasets. Specifically, PIDNet-S achieves 78.6% mIOU with inference speed of 93.2 FPS on Cityscapes and 80.1% mIOU with speed of 153.7 FPS on CamVid.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapes testTime (ms)32.2PIDNet-L
Semantic SegmentationCityscapes testTime (ms)23.7PIDNet-M
Semantic SegmentationCityscapes testTime (ms)10.7PIDNet-S
Semantic SegmentationCamVidTime (ms)11.7PIDNet-M (Cityscapes-Pretrained)
Semantic SegmentationCamVidmIoU82PIDNet-M (Cityscapes-Pretrained)
Semantic SegmentationCamVidTime (ms)6.5PIDNet-S (Cityscapes-Pretrained)
Semantic SegmentationCamVidmIoU80.1PIDNet-S (Cityscapes-Pretrained)
Semantic SegmentationCityscapes valTime (ms)32.2PIDNet-L
Semantic SegmentationCityscapes valTime (ms)23.7PIDNet-M
Semantic SegmentationCityscapes valTime (ms)10.7PIDNet-S
10-shot image generationCityscapes testTime (ms)32.2PIDNet-L
10-shot image generationCityscapes testTime (ms)23.7PIDNet-M
10-shot image generationCityscapes testTime (ms)10.7PIDNet-S
10-shot image generationCamVidTime (ms)11.7PIDNet-M (Cityscapes-Pretrained)
10-shot image generationCamVidmIoU82PIDNet-M (Cityscapes-Pretrained)
10-shot image generationCamVidTime (ms)6.5PIDNet-S (Cityscapes-Pretrained)
10-shot image generationCamVidmIoU80.1PIDNet-S (Cityscapes-Pretrained)
10-shot image generationCityscapes valTime (ms)32.2PIDNet-L
10-shot image generationCityscapes valTime (ms)23.7PIDNet-M
10-shot image generationCityscapes valTime (ms)10.7PIDNet-S

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15