Metric: APho (higher is better)
| # | Model↕ | APho▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | DVIS++(VIT-L, Online) | 27.1 | Yes | DVIS++: Improved Decoupled Framework for Univers... | 2023-12-20 | Code |
| 2 | STMask(R101-DCN-FPN) | 23.7 | No | Spatial Feature Calibration and Temporal Fusion ... | 2021-04-06 | Code |
| 3 | MDQE(SwinL) | 21.6 | No | MDQE: Mining Discriminative Query Embeddings to ... | 2023-03-25 | Code |
| 4 | BoxVIS(Swin-L & Box-sup) | 20.9 | No | BoxVIS: Video Instance Segmentation with Box Ann... | 2023-03-26 | Code |
| 5 | CTVIS (Swin-L) | 19.1 | Yes | CTVIS: Consistent Training for Online Video Inst... | 2023-07-24 | Code |
| 6 | CTVIS (ResNet-50) | 16.1 | Yes | CTVIS: Consistent Training for Online Video Inst... | 2023-07-24 | Code |
| 7 | CMaskTrack R-CNN (ResNet-50) | 4.1 | No | Occluded Video Instance Segmentation: A Benchmark | 2021-02-02 | Code |
| 8 | CSipMask (ResNet-50) | 2.7 | No | Occluded Video Instance Segmentation: A Benchmark | 2021-02-02 | Code |