TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PhraseCut: Language-based Image Segmentation in the Wild

PhraseCut: Language-based Image Segmentation in the Wild

Chenyun Wu, Zhe Lin, Scott Cohen, Trung Bui, Subhransu Maji

2020-08-03CVPR 2020 6AttributeReferring Expression SegmentationSemantic SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

We consider the problem of segmenting image regions given a natural language phrase, and study it on a novel dataset of 77,262 images and 345,486 phrase-region pairs. Our dataset is collected on top of the Visual Genome dataset and uses the existing annotations to generate a challenging set of referring phrases for which the corresponding regions are manually annotated. Phrases in our dataset correspond to multiple regions and describe a large number of object and stuff categories as well as their attributes such as color, shape, parts, and relationships with other entities in the image. Our experiments show that the scale and diversity of concepts in our dataset poses significant challenges to the existing state-of-the-art. We systematically handle the long-tail nature of these concepts and present a modular approach to combine category, attribute, and relationship cues that outperforms existing approaches.

Results

TaskDatasetMetricValueModel
Instance SegmentationPhraseCutMean IoU41.3HULANet
Instance SegmentationPhraseCutPr@0.542.9HULANet
Instance SegmentationPhraseCutPr@0.727.8HULANet
Instance SegmentationPhraseCutPr@0.95.9HULANet
Instance SegmentationPhraseCutMean IoU21.1RMI
Instance SegmentationPhraseCutPr@0.522RMI
Instance SegmentationPhraseCutPr@0.711.6RMI
Instance SegmentationPhraseCutPr@0.91.5RMI
Instance SegmentationPhraseCutMean IoU20.2MattNet
Instance SegmentationPhraseCutPr@0.519.7MattNet
Instance SegmentationPhraseCutPr@0.713.5MattNet
Instance SegmentationPhraseCutPr@0.93MattNet
Referring Expression SegmentationPhraseCutMean IoU41.3HULANet
Referring Expression SegmentationPhraseCutPr@0.542.9HULANet
Referring Expression SegmentationPhraseCutPr@0.727.8HULANet
Referring Expression SegmentationPhraseCutPr@0.95.9HULANet
Referring Expression SegmentationPhraseCutMean IoU21.1RMI
Referring Expression SegmentationPhraseCutPr@0.522RMI
Referring Expression SegmentationPhraseCutPr@0.711.6RMI
Referring Expression SegmentationPhraseCutPr@0.91.5RMI
Referring Expression SegmentationPhraseCutMean IoU20.2MattNet
Referring Expression SegmentationPhraseCutPr@0.519.7MattNet
Referring Expression SegmentationPhraseCutPr@0.713.5MattNet
Referring Expression SegmentationPhraseCutPr@0.93MattNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16Non-Adaptive Adversarial Face Generation2025-07-16SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16