TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Reviving Iterative Training with Mask Guidance for Interac...

Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Konstantin Sofiiuk, Ilia A. Petrov, Anton Konushin

2021-02-12Interactive SegmentationSegmentation
PaperPDFCodeCodeCodeCodeCode

Abstract

Recent works on click-based interactive segmentation have demonstrated state-of-the-art results by using various inference-time optimization schemes. These methods are considerably more computationally expensive compared to feedforward approaches, as they require performing backward passes through a network during inference and are hard to deploy on mobile frameworks that usually support only forward passes. In this paper, we extensively evaluate various design choices for interactive segmentation and discover that new state-of-the-art results can be obtained without any additional optimization schemes. Thus, we propose a simple feedforward model for click-based interactive segmentation that employs the segmentation masks from previous steps. It allows not only to segment an entirely new object, but also to start with an external mask and correct it. When analyzing the performance of models trained on different datasets, we observe that the choice of a training dataset greatly impacts the quality of interactive segmentation. We find that the models trained on a combination of COCO and LVIS with diverse and high-quality annotations show performance superior to all existing models. The code and trained models are available at https://github.com/saic-vul/ritm_interactive_segmentation.

Results

TaskDatasetMetricValueModel
Interactive SegmentationGrabCutNoC@851.42RITM (HRNet18, C+L)
Interactive SegmentationGrabCutNoC@901.54RITM (HRNet18, C+L)
Interactive SegmentationGrabCutNoC@851.76RITM (HRNet18, SBD)
Interactive SegmentationGrabCutNoC@902.04RITM (HRNet18, SBD)
Interactive SegmentationBerkeleyNoC@902.26RITM (HRNet18, C+L)
Interactive SegmentationBerkeleyNoC@903.22RITM (HRNet18, SBD)
Interactive SegmentationDAVISNoC@854.11RITM (HRNet-32, C+L)
Interactive SegmentationDAVISNoC@905.34RITM (HRNet-32, C+L)
Interactive SegmentationDAVISNoC@854.36RITM (HRNet18, SBD)
Interactive SegmentationDAVISNoC@905.74RITM (HRNet18, SBD)
Interactive SegmentationSBDNoC@853.39RITM (HRNet18, SBD)
Interactive SegmentationSBDNoC@905.43RITM (HRNet18, SBD)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17