TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SegGPT: Segmenting Everything In Context

SegGPT: Segmenting Everything In Context

Xinlong Wang, Xiaosong Zhang, Yue Cao, Wen Wang, Chunhua Shen, Tiejun Huang

2023-04-06Personalized SegmentationPanoptic SegmentationSegmentationFew-Shot Semantic SegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCodeCode(official)Code

Abstract

We present SegGPT, a generalist model for segmenting everything in context. We unify various segmentation tasks into a generalist in-context learning framework that accommodates different kinds of segmentation data by transforming them into the same format of images. The training of SegGPT is formulated as an in-context coloring problem with random color mapping for each data sample. The objective is to accomplish diverse tasks according to the context, rather than relying on specific colors. After training, SegGPT can perform arbitrary segmentation tasks in images or videos via in-context inference, such as object instance, stuff, part, contour, and text. SegGPT is evaluated on a broad range of tasks, including few-shot semantic segmentation, video object segmentation, semantic segmentation, and panoptic segmentation. Our results show strong capabilities in segmenting in-domain and out-of-domain targets, either qualitatively or quantitatively.

Results

TaskDatasetMetricValueModel
Few-Shot LearningFSS-1000 (5-shot)Mean IoU89.3SegGPT (ViT)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU67.9SegGPT (ViT)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU85.6SegGPT (ViT)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU83.2SegGPT (ViT)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU56.1SegGPT (ViT)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU89.8SegGPT (ViT)
Few-Shot Semantic SegmentationFSS-1000 (5-shot)Mean IoU89.3SegGPT (ViT)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU67.9SegGPT (ViT)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU85.6SegGPT (ViT)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU83.2SegGPT (ViT)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU56.1SegGPT (ViT)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU89.8SegGPT (ViT)
Meta-LearningFSS-1000 (5-shot)Mean IoU89.3SegGPT (ViT)
Meta-LearningCOCO-20i (5-shot)Mean IoU67.9SegGPT (ViT)
Meta-LearningFSS-1000 (1-shot)Mean IoU85.6SegGPT (ViT)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU83.2SegGPT (ViT)
Meta-LearningCOCO-20i (1-shot)Mean IoU56.1SegGPT (ViT)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU89.8SegGPT (ViT)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17