TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Polyp Segmentation: A Deep Learning Perspective

Video Polyp Segmentation: A Deep Learning Perspective

Ge-Peng Ji, Guobao Xiao, Yu-Cheng Chou, Deng-Ping Fan, Kai Zhao, Geng Chen, Luc van Gool

2022-03-27Video Polyp SegmentationAttributeSegmentationSemantic SegmentationVideo Object SegmentationDeep LearningVideo Object Tracking
PaperPDFCodeCode(official)Code(official)Code(official)

Abstract

We present the first comprehensive video polyp segmentation (VPS) study in the deep learning era. Over the years, developments in VPS are not moving forward with ease due to the lack of large-scale fine-grained segmentation annotations. To address this issue, we first introduce a high-quality frame-by-frame annotated VPS dataset, named SUN-SEG, which contains 158,690 colonoscopy frames from the well-known SUN-database. We provide additional annotations with diverse types, i.e., attribute, object mask, boundary, scribble, and polygon. Second, we design a simple but efficient baseline, dubbed PNS+, consisting of a global encoder, a local encoder, and normalized self-attention (NS) blocks. The global and local encoders receive an anchor frame and multiple successive frames to extract long-term and short-term spatial-temporal representations, which are then progressively updated by two NS blocks. Extensive experiments show that PNS+ achieves the best performance and real-time inference speed (170fps), making it a promising solution for the VPS task. Third, we extensively evaluate 13 representative polyp/object segmentation models on our SUN-SEG dataset and provide attribute-based comparisons. Finally, we discuss several open issues and suggest possible research directions for the VPS community.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationSUN-SEG-Easy (Unseen)Dice0.756PNS+
Medical Image SegmentationSUN-SEG-Easy (Unseen)S measure0.806PNS+
Medical Image SegmentationSUN-SEG-Easy (Unseen)Sensitivity0.63PNS+
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean E-measure0.798PNS+
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean F-measure0.73PNS+
Medical Image SegmentationSUN-SEG-Easy (Unseen)weighted F-measure0.676PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)Dice0.737PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)S-Measure0.797PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)Sensitivity0.623PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean E-measure0.793PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean F-measure0.709PNS+
Medical Image SegmentationSUN-SEG-Hard (Unseen)weighted F-measure0.653PNS+

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17