TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NanoNet: Real-Time Polyp Segmentation in Video Capsule End...

NanoNet: Real-Time Polyp Segmentation in Video Capsule Endoscopy and Colonoscopy

Debesh Jha, Nikhil Kumar Tomar, Sharib Ali, Michael A. Riegler, Håvard D. Johansen, Dag Johansen, Thomas de Lange, Pål Halvorsen

2021-04-22Colorectal Polyps CharacterizationReal-Time Semantic SegmentationSegmentationTransfer LearningSemantic SegmentationMedical Image SegmentationInstrument Recognition
PaperPDFCodeCodeCode(official)

Abstract

Deep learning in gastrointestinal endoscopy can assist to improve clinical performance and be helpful to assess lesions more accurately. To this extent, semantic segmentation methods that can perform automated real-time delineation of a region-of-interest, e.g., boundary identification of cancer or precancerous lesions, can benefit both diagnosis and interventions. However, accurate and real-time segmentation of endoscopic images is extremely challenging due to its high operator dependence and high-definition image quality. To utilize automated methods in clinical settings, it is crucial to design lightweight models with low latency such that they can be integrated with low-end endoscope hardware devices. In this work, we propose NanoNet, a novel architecture for the segmentation of video capsule endoscopy and colonoscopy images. Our proposed architecture allows real-time performance and has higher segmentation accuracy compared to other more complex ones. We use video capsule endoscopy and standard colonoscopy datasets with polyps, and a dataset consisting of endoscopy biopsies and surgical instruments, to evaluate the effectiveness of our approach. Our experiments demonstrate the increased performance of our architecture in terms of a trade-off between model complexity, speed, model parameters, and metric performances. Moreover, the resulting model size is relatively tiny, with only nearly 36,000 parameters compared to traditional deep learning approaches having millions of parameters.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationMedico automatic polyp segmentation challenge (dataset)DSC0.7364NanoNet-A
Medical Image SegmentationMedico automatic polyp segmentation challenge (dataset)FPS28.07NanoNet-A
Medical Image SegmentationMedico automatic polyp segmentation challenge (dataset)Precision0.731NanoNet-A
Medical Image SegmentationMedico automatic polyp segmentation challenge (dataset)Recall0.8566NanoNet-A
Medical Image SegmentationMedico automatic polyp segmentation challenge (dataset)mIoU0.6319NanoNet-A
Medical Image SegmentationKvasirCapsule-SEGDSC0.9493NanoNet
Medical Image SegmentationKvasirCapsule-SEGmIoU0.9059NanoNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17