TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/MeViS

Video on MeViS

Metric: F (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕F▼Extra DataPaperDate↕Code
1MPG-SAM 256.7NoMPG-SAM 2: Adapting SAM 2 with Mask Priors and G...2025-01-23Code
2FindTrack55.9NoFind First, Track Next: Decoupling Identificatio...2025-03-05Code
3GLUS54.2NoGLUS: Global-Local Reasoning Unified into A Sing...2025-04-10Code
4ReferDINO (Swin-B)53.9NoReferDINO: Referring Video Object Segmentation w...2025-01-24-
5VRS-HQ (Chat-UniVi-13B)53.7NoThe Devil is in Temporal Token: High Quality Vid...2025-01-15Code
6SAMWISE51.2NoSAMWISE: Infusing Wisdom in SAM2 for Text-Driven...2024-11-26Code
7DsHmp + MTCM51.1NoMulti-Context Temporal Consistent Modeling for R...2025-01-09Code
8DsHmp49.8NoDecoupling Static and Hierarchical Motion Percep...2024-04-04Code
9HTR45.5NoTemporally Consistent Referring Video Object Seg...2024-03-28Code
10LMPM40.2NoMeViS: A Large-scale Benchmark for Video Segment...2023-08-16Code
11VLT+TC37.3NoVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
12ReferFormer32.2NoLanguage as Queries for Referring Video Object S...2022-01-03Code
13MTTR31.2NoEnd-to-End Referring Video Object Segmentation w...2021-11-29Code
14LBDT30.8NoLanguage-Bridged Spatial-Temporal Interaction fo...2022-06-08Code
15URVOS29.9No--Code