Metric: AQ (higher is better)
| # | Model↕ | AQ▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Video K-Net (Swin-L) | 73 | Yes | Video K-Net: A Simple, Strong, and Unified Basel... | 2022-04-10 | Code |
| 2 | TarViS (Swin-L) | 72 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 3 | TarViS (Swin-T) | 71.2 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 4 | TarViS (ResNet-50) | 70.3 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 5 | Tube-Link(Swin-base) | 69 | Yes | Tube-Link: A Flexible Cross Tube Framework for U... | 2023-03-22 | Code |
| 6 | Unified Perception | 56.4 | No | Unified Perception: Efficient Depth-Aware Video ... | 2023-03-03 | - |