TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Object Segmentation/Refer-YouTube-VOS

Video Object Segmentation on Refer-YouTube-VOS

Metric: J&F (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕J&F▼Extra DataPaperDate↕Code
1FindTrack73.7YesFind First, Track Next: Decoupling Identificatio...2025-03-05Code
2GLEE-Pro70.6YesGeneral Object Foundation Model for Images and V...2023-12-14Code
3HyperSeg68.5YesHyperSeg: Towards Universal Visual Segmentation ...2024-11-26Code
4GLEE-Plus67.7YesGeneral Object Foundation Model for Images and V...2023-12-14Code
5HTR67.1YesTemporally Consistent Referring Video Object Seg...2024-03-28Code
6SOC66YesSOC: Semantic-Assisted Object Cluster for Referr...2023-05-26Code
7SgMg65.7YesSpectrum-guided Multi-granularity Referring Vide...2023-07-25Code
8VATEX65.4NoVision-Aware Text Features in Referring Image Se...2024-04-12Code
9VLT63.8YesVLT: Vision-Language Transformer and Query Gener...2022-10-28Code
10HTML-SwinL63.4Yes---
11HTML-Video-SwinB63.4Yes---
12ReferFormer (Large)62.9YesLanguage as Queries for Referring Video Object S...2022-01-03Code
13HTML-Video-SwinS61.4Yes---
14HTML-Video-SwinT61.2Yes---
15R2VOS (Swin-T)60.2NoTowards Robust Referring Video Object Segmentati...2022-07-04Code
16HTML-ResNet10158.5Yes---
17HTML-ResNet5057.8Yes---
18CMSA36.4YesCross-Modal Self-Attention Network for Referring...2019-04-09Code