Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
MeViS
Video on MeViS
Metric: F (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
F
▼
Extra Data
Paper
Date
↕
Code
1
MPG-SAM 2
56.7
No
MPG-SAM 2: Adapting SAM 2 with Mask Priors and G...
2025-01-23
Code
2
FindTrack
55.9
No
Find First, Track Next: Decoupling Identificatio...
2025-03-05
Code
3
GLUS
54.2
No
GLUS: Global-Local Reasoning Unified into A Sing...
2025-04-10
Code
4
ReferDINO (Swin-B)
53.9
No
ReferDINO: Referring Video Object Segmentation w...
2025-01-24
-
5
VRS-HQ (Chat-UniVi-13B)
53.7
No
The Devil is in Temporal Token: High Quality Vid...
2025-01-15
Code
6
SAMWISE
51.2
No
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven...
2024-11-26
Code
7
DsHmp + MTCM
51.1
No
Multi-Context Temporal Consistent Modeling for R...
2025-01-09
Code
8
DsHmp
49.8
No
Decoupling Static and Hierarchical Motion Percep...
2024-04-04
Code
9
HTR
45.5
No
Temporally Consistent Referring Video Object Seg...
2024-03-28
Code
10
LMPM
40.2
No
MeViS: A Large-scale Benchmark for Video Segment...
2023-08-16
Code
11
VLT+TC
37.3
No
VLT: Vision-Language Transformer and Query Gener...
2022-10-28
Code
12
ReferFormer
32.2
No
Language as Queries for Referring Video Object S...
2022-01-03
Code
13
MTTR
31.2
No
End-to-End Referring Video Object Segmentation w...
2021-11-29
Code
14
LBDT
30.8
No
Language-Bridged Spatial-Temporal Interaction fo...
2022-06-08
Code
15
URVOS
29.9
No
-
-
Code