TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TGANet: Text-guided attention for improved polyp segmentat...

TGANet: Text-guided attention for improved polyp segmentation

Nikhil Kumar Tomar, Debesh Jha, Ulas Bagci, Sharib Ali

2022-05-09AttributePolyp SegmentationMedical Image Segmentation
PaperPDFCode(official)

Abstract

Colonoscopy is a gold standard procedure but is highly operator-dependent. Automated polyp segmentation, a precancerous precursor, can minimize missed rates and timely treatment of colon cancer at an early stage. Even though there are deep learning methods developed for this task, variability in polyp size can impact model training, thereby limiting it to the size attribute of the majority of samples in the training dataset that may provide sub-optimal results to differently sized polyps. In this work, we exploit size-related and polyp number-related features in the form of text attention during training. We introduce an auxiliary classification task to weight the text-based embedding that allows network to learn additional feature representations that can distinctly adapt to differently sized polyps and can adapt to cases with multiple polyps. Our experimental results demonstrate that these added text embeddings improve the overall performance of the model compared to state-of-the-art segmentation methods. We explore four different datasets and provide insights for size-specific improvements. Our proposed text-guided attention network (TGANet) can generalize well to variable-sized polyps in different datasets.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationKvasir-SEGmIoU0.833TGA-Net
Medical Image SegmentationKvasir-SEGmean Dice0.8982TGA-Net
Medical Image SegmentationBKAI-IGH NeoPolyp-SmallAverage Dice0.9023TGANet
Medical Image SegmentationBKAI-IGH NeoPolyp-SmallmIoU0.8409TGANet
Semantic SegmentationKvasir-SEGmDice0.8982TGA-Net
Semantic SegmentationKvasir-SEGmIoU0.833TGA-Net
10-shot image generationKvasir-SEGmDice0.8982TGA-Net
10-shot image generationKvasir-SEGmIoU0.833TGA-Net

Related Papers

DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16Non-Adaptive Adversarial Face Generation2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models2025-07-13