Metric: J&F (higher is better)
| # | Model↕ | J&F▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | FindTrack | 74.2 | No | Find First, Track Next: Decoupling Identificatio... | 2025-03-05 | Code |
| 2 | VD-IT | 69.4 | No | Exploring Pre-trained Text-to-Video Diffusion Mo... | 2024-03-18 | Code |
| 3 | MUTR | 68 | No | Referred by Multi-Modality: A Unified Temporal T... | 2023-05-25 | Code |
| 4 | SOC | 65.8 | No | SOC: Semantic-Assisted Object Cluster for Referr... | 2023-05-26 | Code |
| 5 | DsHmp | 64.9 | No | Decoupling Static and Hierarchical Motion Percep... | 2024-04-04 | Code |
| 6 | LoSh | 64.3 | No | LoSh: Long-Short Text Joint Prediction Network f... | 2023-06-14 | Code |
| 7 | SgMg | 63.3 | No | Spectrum-guided Multi-granularity Referring Vide... | 2023-07-25 | Code |
| 8 | HTML | 62.1 | No | - | - | - |
| 9 | ReferFormer | 61.1 | No | Language as Queries for Referring Video Object S... | 2022-01-03 | Code |
| 10 | LBDT | 54.5 | No | Language-Bridged Spatial-Temporal Interaction fo... | 2022-06-08 | Code |
| 11 | URVOS | 51.6 | No | - | - | Code |