TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

Juncai Peng, Yi Liu, Shiyu Tang, Yuying Hao, Lutao Chu, Guowei Chen, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

2022-04-06Real-Time Semantic SegmentationSegmentationSemantic Segmentation
PaperPDFCode(official)CodeCode

Abstract

Real-world applications have high demands for semantic segmentation methods. Although semantic segmentation has made remarkable leap-forwards with deep learning, the performance of real-time methods is not satisfactory. In this work, we propose PP-LiteSeg, a novel lightweight model for the real-time semantic segmentation task. Specifically, we present a Flexible and Lightweight Decoder (FLD) to reduce computation overhead of previous decoder. To strengthen feature representations, we propose a Unified Attention Fusion Module (UAFM), which takes advantage of spatial and channel attention to produce a weight and then fuses the input features with the weight. Moreover, a Simple Pyramid Pooling Module (SPPM) is proposed to aggregate global context with low computation cost. Extensive evaluations demonstrate that PP-LiteSeg achieves a superior trade-off between accuracy and speed compared to other methods. On the Cityscapes test set, PP-LiteSeg achieves 72.0% mIoU/273.6 FPS and 77.5% mIoU/102.6 FPS on NVIDIA GTX 1080Ti. Source code and models are available at PaddleSeg: https://github.com/PaddlePaddle/PaddleSeg.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCamVidFrame (fps)154.8PP-LiteSeg-B
Semantic SegmentationCamVidmIoU75PP-LiteSeg-B
Semantic SegmentationCamVidFrame (fps)222.3PP-LiteSeg-T
Semantic SegmentationCamVidmIoU73.3PP-LiteSeg-T
Semantic SegmentationCityscapes valmIoU78.2PP-LiteSeg-B2
Semantic SegmentationCityscapes valmIoU76PP-LiteSeg-T2
Semantic SegmentationCityscapes valmIoU75.3PP-LiteSeg-B1
Semantic SegmentationCityscapes valmIoU73.1PP-LiteSeg-T1
10-shot image generationCamVidFrame (fps)154.8PP-LiteSeg-B
10-shot image generationCamVidmIoU75PP-LiteSeg-B
10-shot image generationCamVidFrame (fps)222.3PP-LiteSeg-T
10-shot image generationCamVidmIoU73.3PP-LiteSeg-T
10-shot image generationCityscapes valmIoU78.2PP-LiteSeg-B2
10-shot image generationCityscapes valmIoU76PP-LiteSeg-T2
10-shot image generationCityscapes valmIoU75.3PP-LiteSeg-B1
10-shot image generationCityscapes valmIoU73.1PP-LiteSeg-T1

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17