PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

Juncai Peng, Yi Liu, Shiyu Tang, Yuying Hao, Lutao Chu, Guowei Chen, Zewu Wu, Zeyu Chen, Zhiliang Yu, Yuning Du, Qingqing Dang, Baohua Lai, Qiwen Liu, Xiaoguang Hu, dianhai yu, Yanjun Ma

2022-04-06Real-Time Semantic Segmentation Segmentation Semantic Segmentation

Paper PDF Code(official)Code Code

Abstract

Real-world applications have high demands for semantic segmentation methods. Although semantic segmentation has made remarkable leap-forwards with deep learning, the performance of real-time methods is not satisfactory. In this work, we propose PP-LiteSeg, a novel lightweight model for the real-time semantic segmentation task. Specifically, we present a Flexible and Lightweight Decoder (FLD) to reduce computation overhead of previous decoder. To strengthen feature representations, we propose a Unified Attention Fusion Module (UAFM), which takes advantage of spatial and channel attention to produce a weight and then fuses the input features with the weight. Moreover, a Simple Pyramid Pooling Module (SPPM) is proposed to aggregate global context with low computation cost. Extensive evaluations demonstrate that PP-LiteSeg achieves a superior trade-off between accuracy and speed compared to other methods. On the Cityscapes test set, PP-LiteSeg achieves 72.0% mIoU/273.6 FPS and 77.5% mIoU/102.6 FPS on NVIDIA GTX 1080Ti. Source code and models are available at PaddleSeg: https://github.com/PaddlePaddle/PaddleSeg.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	CamVid	Frame (fps)	154.8	PP-LiteSeg-B
Semantic Segmentation	CamVid	mIoU	75	PP-LiteSeg-B
Semantic Segmentation	CamVid	Frame (fps)	222.3	PP-LiteSeg-T
Semantic Segmentation	CamVid	mIoU	73.3	PP-LiteSeg-T
Semantic Segmentation	Cityscapes val	mIoU	78.2	PP-LiteSeg-B2
Semantic Segmentation	Cityscapes val	mIoU	76	PP-LiteSeg-T2
Semantic Segmentation	Cityscapes val	mIoU	75.3	PP-LiteSeg-B1
Semantic Segmentation	Cityscapes val	mIoU	73.1	PP-LiteSeg-T1
10-shot image generation	CamVid	Frame (fps)	154.8	PP-LiteSeg-B
10-shot image generation	CamVid	mIoU	75	PP-LiteSeg-B
10-shot image generation	CamVid	Frame (fps)	222.3	PP-LiteSeg-T
10-shot image generation	CamVid	mIoU	73.3	PP-LiteSeg-T
10-shot image generation	Cityscapes val	mIoU	78.2	PP-LiteSeg-B2
10-shot image generation	Cityscapes val	mIoU	76	PP-LiteSeg-T2
10-shot image generation	Cityscapes val	mIoU	75.3	PP-LiteSeg-B1
10-shot image generation	Cityscapes val	mIoU	73.1	PP-LiteSeg-T1

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

Abstract

Results

Related Papers

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model

Abstract

Results

Related Papers