Metric: F (higher is better)
| # | Model↕ | F▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | FindTrack | 78.5 | No | Find First, Track Next: Decoupling Identificatio... | 2025-03-05 | Code |
| 2 | VD-IT | 72.6 | No | Exploring Pre-trained Text-to-Video Diffusion Mo... | 2024-03-18 | Code |
| 3 | MUTR | 71.3 | No | Referred by Multi-Modality: A Unified Temporal T... | 2023-05-25 | Code |
| 4 | SOC | 69.1 | No | SOC: Semantic-Assisted Object Cluster for Referr... | 2023-05-26 | Code |
| 5 | DsHmp | 68.1 | No | Decoupling Static and Hierarchical Motion Percep... | 2024-04-04 | Code |
| 6 | LoSh | 66.8 | No | LoSh: Long-Short Text Joint Prediction Network f... | 2023-06-14 | Code |
| 7 | SgMg | 66 | No | Spectrum-guided Multi-granularity Referring Vide... | 2023-07-25 | Code |
| 8 | HTML | 65.1 | No | - | - | - |
| 9 | ReferFormer | 64.1 | No | Language as Queries for Referring Video Object S... | 2022-01-03 | Code |
| 10 | URVOS | 56 | No | - | - | Code |