Metric: J (higher is better)
| # | Model↕ | J▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | FindTrack | 69.9 | No | Find First, Track Next: Decoupling Identificatio... | 2025-03-05 | Code |
| 2 | VD-IT | 66.2 | No | Exploring Pre-trained Text-to-Video Diffusion Mo... | 2024-03-18 | Code |
| 3 | MUTR | 64.8 | No | Referred by Multi-Modality: A Unified Temporal T... | 2023-05-25 | Code |
| 4 | SOC | 62.5 | No | SOC: Semantic-Assisted Object Cluster for Referr... | 2023-05-26 | Code |
| 5 | LoSh | 61.8 | No | LoSh: Long-Short Text Joint Prediction Network f... | 2023-06-14 | Code |
| 6 | DsHmp | 61.7 | No | Decoupling Static and Hierarchical Motion Percep... | 2024-04-04 | Code |
| 7 | SgMg | 60.6 | No | Spectrum-guided Multi-granularity Referring Vide... | 2023-07-25 | Code |
| 8 | HTML | 59.2 | No | - | - | - |
| 9 | ReferFormer | 58.1 | No | Language as Queries for Referring Video Object S... | 2022-01-03 | Code |
| 10 | URVOS | 47.3 | No | - | - | Code |