Metric: VPQ (higher is better)
| # | Model↕ | VPQ▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VIP-Deeplab | 63.1 | Yes | ViP-DeepLab: Learning Visual Perception with Dep... | 2020-12-09 | Code |
| 2 | PolyphonicFormer | 62.3 | Yes | PolyphonicFormer: Unified Query Learning for Dep... | 2021-12-05 | Code |
| 3 | Video K-Net (Swin-B) | 62.2 | Yes | Video K-Net: A Simple, Strong, and Unified Basel... | 2022-04-10 | Code |
| 4 | TarViS (Swin-L) | 58.9 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 5 | TarViS (Swin-T) | 58 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 6 | VPSNet-SiamTrack | 57.3 | Yes | Learning to Associate Every Segment for Video Pa... | 2021-06-17 | - |
| 7 | VPSNet | 57 | Yes | Video Panoptic Segmentation | 2020-06-19 | Code |
| 8 | TarViS (ResNet-50) | 53.3 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |