Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Referring Expression Segmentation
/
A2D Sentences
Referring Expression Segmentation on A2D Sentences
Metric: IoU overall (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
IoU overall (best first)
IoU overall (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
IoU overall
▼
Extra Data
Paper
Date
↕
Code
1
SOC (Video-Swin-B)
0.807
Yes
SOC: Semantic-Assisted Object Cluster for Referr...
2023-05-26
Code
2
SgMg (Video-Swin-B)
0.799
Yes
Spectrum-guided Multi-granularity Referring Vide...
2023-07-25
Code
3
ReferFormer (Video-Swin-B)
0.786
Yes
Language as Queries for Referring Video Object S...
2022-01-03
Code
4
SOC (Video-Swin-T)
0.747
No
SOC: Semantic-Assisted Object Cluster for Referr...
2023-05-26
Code
5
MANET
0.726
No
Multi-Attention Network for Compressed Video Ref...
2022-07-26
Code
6
MTTR (w=10)
0.72
No
End-to-End Referring Video Object Segmentation w...
2021-11-29
Code
7
VLIDE
0.714
No
Deeply Interleaved Two-Stream Encoder for Referr...
2022-03-30
-
8
MTTR (w=8)
0.702
No
End-to-End Referring Video Object Segmentation w...
2021-11-29
Code
9
Locater
0.69
No
Local-Global Context Aware Transformer for Langu...
2022-03-18
Code
10
HINet
0.679
No
-
-
-
11
mmmmtbvs
0.673
No
Modeling Motion with Multi-Modal Features for Te...
2022-04-06
Code
12
RefVOS
0.672
No
-
-
-
13
Hui et al.
0.662
No
Collaborative Spatial-Temporal Modeling for Lang...
2021-05-14
-
14
PRPE
0.661
No
-
-
-
15
CMPC-V (I3D)
0.653
No
Cross-Modal Progressive Comprehension for Referr...
2021-05-15
Code
16
CMPC-V (R2D)
0.649
No
Cross-Modal Progressive Comprehension for Referr...
2021-05-15
Code
17
ClawCraneNet
0.644
No
ClawCraneNet: Leveraging Object-level Relation f...
2021-03-19
-
18
CMDy
0.623
No
-
-
-
19
CMSA+CFSA
0.618
No
Referring Segmentation in Images and Videos with...
2021-02-09
-
20
AAMN
0.617
No
Actor and Action Modular Network for Text-based ...
2020-11-02
-
21
ACGA
0.601
No
-
-
Code
22
RefVOS
0.599
No
RefVOS: A Closer Look at Referring Expressions f...
2020-10-01
Code
23
VT-Capsule
0.568
No
-
-
-
24
Gavriluyk el al. (Optical flow)
0.551
No
Actor and Action Video Segmentation from a Sente...
2018-03-20
Code
25
Gavriluyk el al.
0.536
No
Actor and Action Video Segmentation from a Sente...
2018-03-20
Code
26
Li et al.
0.515
No
-
-
-
27
Hu et al.
0.474
No
Segmentation from Natural Language Expressions
2016-03-20
Code
#1
SOC (Video-Swin-B)
SOTA
0.807
IoU overall
· Extra Data
· 2023-05-26
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
Code
#2
SgMg (Video-Swin-B)
0.799
IoU overall
· Extra Data
· 2023-07-25
Spectrum-guided Multi-granularity Referring Video Object Segmentation
Code
#3
ReferFormer (Video-Swin-B)
SOTA
0.786
IoU overall
· Extra Data
· 2022-01-03
Language as Queries for Referring Video Object Segmentation
Code
#4
SOC (Video-Swin-T)
0.747
IoU overall
· 2023-05-26
SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation
Code
#5
MANET
0.726
IoU overall
· 2022-07-26
Multi-Attention Network for Compressed Video Referring Object Segmentation
Code
#6
MTTR (w=10)
SOTA
0.72
IoU overall
· 2021-11-29
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Code
#7
VLIDE
0.714
IoU overall
· 2022-03-30
Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation
#8
MTTR (w=8)
0.702
IoU overall
· 2021-11-29
End-to-End Referring Video Object Segmentation with Multimodal Transformers
Code
#9
Locater
0.69
IoU overall
· 2022-03-18
Local-Global Context Aware Transformer for Language-Guided Video Segmentation
Code
#10
HINet
0.679
IoU overall
No paper
#11
mmmmtbvs
0.673
IoU overall
· 2022-04-06
Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Code
#12
RefVOS
0.672
IoU overall
No paper
#13
Hui et al.
SOTA
0.662
IoU overall
· 2021-05-14
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation
#14
PRPE
0.661
IoU overall
No paper
#15
CMPC-V (I3D)
0.653
IoU overall
· 2021-05-15
Cross-Modal Progressive Comprehension for Referring Segmentation
Code
#16
CMPC-V (R2D)
0.649
IoU overall
· 2021-05-15
Cross-Modal Progressive Comprehension for Referring Segmentation
Code
#17
ClawCraneNet
SOTA
0.644
IoU overall
· 2021-03-19
ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation
#18
CMDy
0.623
IoU overall
No paper
#19
CMSA+CFSA
SOTA
0.618
IoU overall
· 2021-02-09
Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network
#20
AAMN
SOTA
0.617
IoU overall
· 2020-11-02
Actor and Action Modular Network for Text-based Video Segmentation
#21
ACGA
0.601
IoU overall
No paper
Code
#22
RefVOS
SOTA
0.599
IoU overall
· 2020-10-01
RefVOS: A Closer Look at Referring Expressions for Video Object Segmentation
Code
#23
VT-Capsule
0.568
IoU overall
No paper
#24
Gavriluyk el al. (Optical flow)
SOTA
0.551
IoU overall
· 2018-03-20
Actor and Action Video Segmentation from a Sentence
Code
#25
Gavriluyk el al.
0.536
IoU overall
· 2018-03-20
Actor and Action Video Segmentation from a Sentence
Code
#26
Li et al.
0.515
IoU overall
No paper
#27
Hu et al.
SOTA
0.474
IoU overall
· 2016-03-20
Segmentation from Natural Language Expressions
Code