TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video/Refer-YouTube-VOS

Video on Refer-YouTube-VOS

Metric: F (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕F▼Extra DataPaperDate↕Code
1FindTrack75.7YesFind First, Track Next: Decoupling Identificatio...2025-03-05Code
2GLEE-Pro72.9YesGeneral Object Foundation Model for Images and V...2023-12-14Code
3GLEE-Plus69.7YesGeneral Object Foundation Model for Images and V...2023-12-14Code
4HTR68.9YesTemporally Consistent Referring Video Object Seg...2024-03-28Code
5SOC67.9YesSOC: Semantic-Assisted Object Cluster for Referr...2023-05-26Code
6VATEX67.5NoVision-Aware Text Features in Referring Image Se...2024-04-12Code
7SgMg67.4YesSpectrum-guided Multi-granularity Referring Vide...2023-07-25Code
8VLT65.6YesVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
9HTML-SwinL65.3Yes---
10HTML-Video-SwinB65.2Yes---
11ReferFormer (Large)64.6YesLanguage as Queries for Referring Video Object S...2022-01-03Code
12HTML-Video-SwinT63Yes---
13HTML-Video-SwinS62.9Yes---
14R2VOS (Swin-T)61.5NoTowards Robust Referring Video Object Segmentati...2022-07-04Code
15HTML-ResNet10159.8Yes---
16HTML-ResNet5059Yes---
17CMSA38.1YesCross-Modal Self-Attention Network for Referring...2019-04-09Code