TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Referring Expression Segmentation/RefCOCOg-val

Referring Expression Segmentation on RefCOCOg-val

Metric: Overall IoU (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Overall IoU▼Extra DataPaperDate↕Code
1MLCD-Seg-7B79.9YesMulti-label Cluster Discrimination for Visual Re...2024-07-24Code
2HyperSeg79.4YesHyperSeg: Towards Universal Visual Segmentation ...2024-11-26Code
3UniLSeg-10079.27YesUniversal Segmentation at Arbitrary Granularity ...2023-12-04Code
4UniLSeg-2078.41YesUniversal Segmentation at Arbitrary Granularity ...2023-12-04Code
5EVF-SAM78.2YesEVF-SAM: Early Vision-Language Fusion for Text-P...2024-06-28Code
6SegAgent75.11NoSegAgent: Exploring Pixel Understanding Capabili...2025-03-11Code
7DETRIS74.6NoDensely Connected Parameter-Efficient Tuning for...2025-01-15Code
8C3VG74.43NoMulti-task Visual Grounding with Coarse-to-Fine ...2025-01-12Code
9GROUNDHOG74.1YesGROUNDHOG: Grounding Large Language Models to Ho...2024-02-26-
10GLEE-Pro72.9YesGeneral Object Foundation Model for Images and V...2023-12-14Code
11SafaRi-B70.48YesSafaRi:Adaptive Sequence Transformer for Weakly ...2024-07-02-
12PolyFormer-L69.2YesPolyFormer: Referring Image Segmentation as Sequ...2023-02-14Code
13MaskRIS (Swin-B, combined DB)69.12NoMaskRIS: Semantic Distortion-aware Data Augmenta...2024-11-28Code
14PolyFormer-B67.76YesPolyFormer: Referring Image Segmentation as Sequ...2023-02-14Code
15MaskRIS (Swin-B)65.55NoMaskRIS: Semantic Distortion-aware Data Augmenta...2024-11-28Code
16MagNet65.36NoMask Grounding for Referring Image Segmentation2023-12-19Code
17X-Decoder (Davit-d5)64.6YesGeneralized Decoding for Pixel, Image, and Langu...2022-12-21Code
18VLT (Swin-B)63.49NoVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
19LAVT61.24NoLAVT: Language-Aware Vision Transformer for Refe...2021-12-04Code
20VLT (Darknet53)52.99NoVision-Language Transformer and Query Generation...2021-08-12Code
21SHNet49.9NoComprehensive Multi-Modal Interactions for Refer...2021-04-21Code