TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/GRES: Generalized Referring Expression Segmentation

GRES: Generalized Referring Expression Segmentation

Chang Liu, Henghui Ding, Xudong Jiang

2023-06-01CVPR 2023 1Referring ExpressionGeneralized Referring Expression SegmentationReferring Expression SegmentationSegmentation
PaperPDFCodeCode(official)

Abstract

Referring Expression Segmentation (RES) aims to generate a segmentation mask for the object described by a given language expression. Existing classic RES datasets and methods commonly support single-target expressions only, i.e., one expression refers to one target object. Multi-target and no-target expressions are not considered. This limits the usage of RES in practice. In this paper, we introduce a new benchmark called Generalized Referring Expression Segmentation (GRES), which extends the classic RES to allow expressions to refer to an arbitrary number of target objects. Towards this, we construct the first large-scale GRES dataset called gRefCOCO that contains multi-target, no-target, and single-target expressions. GRES and gRefCOCO are designed to be well-compatible with RES, facilitating extensive experiments to study the performance gap of the existing RES methods on the GRES task. In the experimental study, we find that one of the big challenges of GRES is complex relationship modeling. Based on this, we propose a region-based GRES baseline ReLA that adaptively divides the image into regions with sub-instance clues, and explicitly models the region-region and region-language dependencies. The proposed approach ReLA achieves new state-of-the-art performance on the both newly proposed GRES and classic RES tasks. The proposed gRefCOCO dataset and method are available at https://henghuiding.github.io/GRES.

Results

TaskDatasetMetricValueModel
Instance SegmentationRefCoCo valOverall IoU73.82ReLA
Instance SegmentationRefCOCO+ valOverall IoU66.04ReLA
Instance SegmentationRefCOCO+ test BOverall IoU57.65ReLA
Instance SegmentationRefCOCO+ testAOverall IoU71.02ReLA
Instance SegmentationgRefCOCOcIoU62.42ReLA
Instance SegmentationgRefCOCOgIoU63.6ReLA
Referring Expression SegmentationRefCoCo valOverall IoU73.82ReLA
Referring Expression SegmentationRefCOCO+ valOverall IoU66.04ReLA
Referring Expression SegmentationRefCOCO+ test BOverall IoU57.65ReLA
Referring Expression SegmentationRefCOCO+ testAOverall IoU71.02ReLA
Referring Expression SegmentationgRefCOCOcIoU62.42ReLA
Referring Expression SegmentationgRefCOCOgIoU63.6ReLA

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17