TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MST: Adaptive Multi-Scale Tokens Guided Interactive Segmen...

MST: Adaptive Multi-Scale Tokens Guided Interactive Segmentation

Long Xu, Shanghong Li, Yongquan Chen, Jun Luo, Shiwu Lai

2024-01-09BenchmarkingInteractive SegmentationSegmentation
PaperPDFCode(official)

Abstract

Interactive segmentation has gained significant attention for its application in human-computer interaction and data annotation. To address the target scale variation issue in interactive segmentation, a novel multi-scale token adaptation algorithm is proposed. By performing top-k operations across multi-scale tokens, the computational complexity is greatly simplified while ensuring performance. To enhance the robustness of multi-scale token selection, we also propose a token learning algorithm based on contrastive loss. This algorithm can effectively improve the performance of multi-scale token adaptation. Extensive benchmarking shows that the algorithm achieves state-of-the-art (SOTA) performance, compared to current methods. An interactive demo and all reproducible codes will be released at https://github.com/hahamyt/mst.

Results

TaskDatasetMetricValueModel
Interactive SegmentationGrabCutNoC@901.48ViT-B+MST+CL
Interactive SegmentationBerkeleyNoC@901.5ViT-B+MST+CL
Interactive SegmentationCOCO minivalNoC@852.08ViT-B+MST+CL
Interactive SegmentationCOCO minivalNoC@902.85ViT-B+MST+CL
Interactive SegmentationDAVIS-585NoC@851.8ViT-B+MST+CL
Interactive SegmentationDAVIS-585NoC@902.29ViT-B+MST+CL
Interactive SegmentationPascalVOCNoC@851.69ViT-B+MST+CL
Interactive SegmentationPascalVOCNoC@901.9ViT-B+MST+CL
Interactive SegmentationDAVISNoC@904.55ViT-B+MST+CL
Interactive SegmentationSBDNoC@853.03ViT-B+MST+CL

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Visual Place Recognition for Large-Scale UAV Applications2025-07-20Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17