Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video
/
MeViS
Video on MeViS
Metric: J (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
J
▼
Extra Data
Paper
Date
↕
Code
1
MPG-SAM 2
50.7
No
MPG-SAM 2: Adapting SAM 2 with Mask Priors and G...
2025-01-23
Code
2
FindTrack
50.5
No
Find First, Track Next: Decoupling Identificatio...
2025-03-05
Code
3
GLUS
48.5
No
GLUS: Global-Local Reasoning Unified into A Sing...
2025-04-10
Code
4
VRS-HQ (Chat-UniVi-13B)
48
No
The Devil is in Temporal Token: High Quality Vid...
2025-01-15
Code
5
SAMWISE
45.4
No
SAMWISE: Infusing Wisdom in SAM2 for Text-Driven...
2024-11-26
Code
6
ReferDINO (Swin-B)
44.7
No
ReferDINO: Referring Video Object Segmentation w...
2025-01-24
-
7
DsHmp + MTCM
44.1
No
Multi-Context Temporal Consistent Modeling for R...
2025-01-09
Code
8
DsHmp
43
No
Decoupling Static and Hierarchical Motion Percep...
2024-04-04
Code
9
HTR
39.9
No
Temporally Consistent Referring Video Object Seg...
2024-03-28
Code
10
LMPM
34.2
No
MeViS: A Large-scale Benchmark for Video Segment...
2023-08-16
Code
11
VLT+TC
33.6
No
VLT: Vision-Language Transformer and Query Gener...
2022-10-28
Code
12
ReferFormer
29.8
No
Language as Queries for Referring Video Object S...
2022-01-03
Code
13
MTTR
28.8
No
End-to-End Referring Video Object Segmentation w...
2021-11-29
Code
14
LBDT
27.8
No
Language-Bridged Spatial-Temporal Interaction fo...
2022-06-08
Code
15
URVOS
25.7
No
-
-
Code