Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Referring Expression Segmentation
/
RefCOCOg-test
Referring Expression Segmentation on RefCOCOg-test
Metric: Overall IoU (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
Overall IoU
▼
Extra Data
Paper
Date
↕
Code
1
UniLSeg-100
80.54
Yes
Universal Segmentation at Arbitrary Granularity ...
2023-12-04
Code
2
MLCD-Seg-7B
80.5
Yes
Multi-label Cluster Discrimination for Visual Re...
2024-07-24
Code
3
UniLSeg-20
79.47
Yes
Universal Segmentation at Arbitrary Granularity ...
2023-12-04
Code
4
HyperSeg
78.9
Yes
HyperSeg: Towards Universal Visual Segmentation ...
2024-11-26
Code
5
EVF-SAM
78.3
Yes
EVF-SAM: Early Vision-Language Fusion for Text-P...
2024-06-28
Code
6
C3VG
76.39
No
Multi-task Visual Grounding with Coarse-to-Fine ...
2025-01-12
Code
7
DETRIS
75.3
No
Densely Connected Parameter-Efficient Tuning for...
2025-01-15
Code
8
GROUNDHOG
74.6
Yes
GROUNDHOG: Grounding Large Language Models to Ho...
2024-02-26
-
9
MaskRIS (Swin-B, combined DB)
71.09
No
MaskRIS: Semantic Distortion-aware Data Augmenta...
2024-11-28
Code
10
SafaRi-B
71.06
Yes
SafaRi:Adaptive Sequence Transformer for Weakly ...
2024-07-02
-
11
PolyFormer-L
70.19
Yes
PolyFormer: Referring Image Segmentation as Sequ...
2023-02-14
Code
12
PolyFormer-B
69.05
Yes
PolyFormer: Referring Image Segmentation as Sequ...
2023-02-14
Code
13
MaskRIS (Swin-B)
66.5
No
MaskRIS: Semantic Distortion-aware Data Augmenta...
2024-11-28
Code
14
MagNet
66.03
No
Mask Grounding for Referring Image Segmentation
2023-12-19
Code
15
LAVT (Swin-B)
62.09
No
LAVT: Language-Aware Vision Transformer for Refe...
2021-12-04
Code
16
VLT (Darknet53)
56.65
No
Vision-Language Transformer and Query Generation...
2021-08-12
Code