TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Object Segmentation/Refer-YouTube-VOS

Video Object Segmentation on Refer-YouTube-VOS

Metric: J (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕J▼Extra DataPaperDate↕Code
1FindTrack71.8YesFind First, Track Next: Decoupling Identificatio...2025-03-05Code
2GLEE-Pro68.2YesGeneral Object Foundation Model for Images and V...2023-12-14Code
3GLEE-Plus65.6YesGeneral Object Foundation Model for Images and V...2023-12-14Code
4HTR65.3YesTemporally Consistent Referring Video Object Seg...2024-03-28Code
5SOC64.1YesSOC: Semantic-Assisted Object Cluster for Referr...2023-05-26Code
6SgMg63.9YesSpectrum-guided Multi-granularity Referring Vide...2023-07-25Code
7VATEX63.3NoVision-Aware Text Features in Referring Image Se...2024-04-12Code
8VLT61.9YesVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
9HTML-SwinL61.5Yes---
10HTML-Video-SwinB61.5Yes---
11ReferFormer (Large)61.3YesLanguage as Queries for Referring Video Object S...2022-01-03Code
12HTML-Video-SwinS59.9Yes---
13HTML-Video-SwinT59.5Yes---
14R2VOS (Swin-T)58.9NoTowards Robust Referring Video Object Segmentati...2022-07-04Code
15HTML-ResNet10157.3Yes---
16HTML-ResNet5056.5Yes---
17CMSA34.8YesCross-Modal Self-Attention Network for Referring...2019-04-09Code