Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Referring Expression Segmentation
/
RefCOCO testB
Referring Expression Segmentation on RefCOCO testB
Metric: Overall IoU (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
Overall IoU (best first)
Overall IoU (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
Overall IoU
▼
Extra Data
Paper
Date
↕
Code
1
HyperSeg
83.4
Yes
HyperSeg: Towards Universal Visual Segmentation ...
2024-11-26
Code
2
DeRIS-L
82.87
No
DeRIS: Decoupling Perception and Cognition for E...
2025-07-02
Code
3
MLCD-Seg-7B
81.5
Yes
Multi-label Cluster Discrimination for Visual Re...
2024-07-24
Code
4
EVF-SAM
80.2
Yes
EVF-SAM: Early Vision-Language Fusion for Text-P...
2024-06-28
Code
5
DETRIS
79
No
Densely Connected Parameter-Efficient Tuning for...
2025-01-15
Code
6
C3VG
77.86
No
Multi-task Visual Grounding with Coarse-to-Fine ...
2025-01-12
Code
7
MaskRIS (Swin-B, combined DB)
75.1
No
MaskRIS: Semantic Distortion-aware Data Augmenta...
2024-11-28
Code
8
MaskRIS (Swin-B)
73.96
No
MaskRIS: Semantic Distortion-aware Data Augmenta...
2024-11-28
Code
9
EVP
72.94
No
EVP: Enhanced Visual Perception using Inverse Mu...
2023-12-13
Code
10
MagNet
71.05
No
Mask Grounding for Referring Image Segmentation
2023-12-19
Code
11
SafaRi
70.71
No
SafaRi:Adaptive Sequence Transformer for Weakly ...
2024-07-02
-
12
SeqTR
64.12
No
SeqTR: A Simple yet Universal Network for Visual...
2022-03-30
Code
#1
HyperSeg
SOTA
83.4
Overall IoU
· Extra Data
· 2024-11-26
HyperSeg: Towards Universal Visual Segmentation with Large Language Model
Code
#2
DeRIS-L
82.87
Overall IoU
· 2025-07-02
DeRIS: Decoupling Perception and Cognition for Enhanced Referring Image Segmentation through Loopback Synergy
Code
#3
MLCD-Seg-7B
SOTA
81.5
Overall IoU
· Extra Data
· 2024-07-24
Multi-label Cluster Discrimination for Visual Representation Learning
Code
#4
EVF-SAM
SOTA
80.2
Overall IoU
· Extra Data
· 2024-06-28
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Code
#5
DETRIS
79
Overall IoU
· 2025-01-15
Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
Code
#6
C3VG
77.86
Overall IoU
· 2025-01-12
Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
Code
#7
MaskRIS (Swin-B, combined DB)
75.1
Overall IoU
· 2024-11-28
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
Code
#8
MaskRIS (Swin-B)
73.96
Overall IoU
· 2024-11-28
MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation
Code
#9
EVP
SOTA
72.94
Overall IoU
· 2023-12-13
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text Alignment
Code
#10
MagNet
71.05
Overall IoU
· 2023-12-19
Mask Grounding for Referring Image Segmentation
Code
#11
SafaRi
70.71
Overall IoU
· 2024-07-02
SafaRi:Adaptive Sequence Transformer for Weakly Supervised Referring Expression Segmentation
#12
SeqTR
SOTA
64.12
Overall IoU
· 2022-03-30
SeqTR: A Simple yet Universal Network for Visual Grounding
Code