TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/MeViS

Video on MeViS

Metric: J (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕J▼Extra DataPaperDate↕Code
1MPG-SAM 250.7NoMPG-SAM 2: Adapting SAM 2 with Mask Priors and G...2025-01-23Code
2FindTrack50.5NoFind First, Track Next: Decoupling Identificatio...2025-03-05Code
3GLUS48.5NoGLUS: Global-Local Reasoning Unified into A Sing...2025-04-10Code
4VRS-HQ (Chat-UniVi-13B)48NoThe Devil is in Temporal Token: High Quality Vid...2025-01-15Code
5SAMWISE45.4NoSAMWISE: Infusing Wisdom in SAM2 for Text-Driven...2024-11-26Code
6ReferDINO (Swin-B)44.7NoReferDINO: Referring Video Object Segmentation w...2025-01-24-
7DsHmp + MTCM44.1NoMulti-Context Temporal Consistent Modeling for R...2025-01-09Code
8DsHmp43NoDecoupling Static and Hierarchical Motion Percep...2024-04-04Code
9HTR39.9NoTemporally Consistent Referring Video Object Seg...2024-03-28Code
10LMPM34.2NoMeViS: A Large-scale Benchmark for Video Segment...2023-08-16Code
11VLT+TC33.6NoVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
12ReferFormer29.8NoLanguage as Queries for Referring Video Object S...2022-01-03Code
13MTTR28.8NoEnd-to-End Referring Video Object Segmentation w...2021-11-29Code
14LBDT27.8NoLanguage-Bridged Spatial-Temporal Interaction fo...2022-06-08Code
15URVOS25.7No--Code