Metric: SQ (higher is better)
| # | Model↕ | SQ▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Video K-Net (Swin-L) | 75 | Yes | Video K-Net: A Simple, Strong, and Unified Basel... | 2022-04-10 | Code |
| 2 | Tube-Link(Swin-base) | 74 | Yes | Tube-Link: A Flexible Cross Tube Framework for U... | 2023-03-22 | Code |
| 3 | TarViS (Swin-L) | 72 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 4 | TarViS (Swin-T) | 69.9 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 5 | TarViS (ResNet-50) | 68.8 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 6 | Unified Perception | 61.9 | No | Unified Perception: Efficient Depth-Aware Video ... | 2023-03-03 | - |