TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Instance Segmentation/OVIS validation

Video Instance Segmentation on OVIS validation

Metric: AR1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AR1▼Extra DataPaperDate↕Code
1CAVIS(VIT-L, Offline)21.2YesContext-Aware Video Instance Segmentation2024-07-03Code
2DVIS++(VIT-L, Online)20.8YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
3DVIS(Swin-L, Offline)19.4NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
4DVIS(Swin-L, Online)19.4NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
5NOVIS (Swin-L)19.4YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
6GRAtt-VIS (Swin-L)19.2YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
7RefineVIS (Swin-L, offline)19.1YesRefineVIS: Video Instance Segmentation with Temp...2023-06-07-
8GenVIS (Swin-L)18.9YesA Generalized Framework for Video Instance Segme...2022-11-16Code
9ROVIS (Swin-L)18.4NoRobust Online Video Instance Segmentation with T...2022-11-16Code
10MDQE(SwinL)18.3NoMDQE: Mining Discriminative Query Embeddings to ...2023-03-25Code
11MinVIS (Swin-L)18.1NoMinVIS: A Minimal Video Instance Segmentation Fr...2022-08-03Code
12TarViS (Swin-L)18YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
13IDOL (Swin-L)17.9NoIn Defense of Online Models for Video Instance S...2022-07-21Code
14DVIS++(R50, Offline)16.8YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
15GRAtt-VIS (ResNet-50)16.8YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
16DeVIS (Swin-L)16.6NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
17TarViS (Swin-T)16.1YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
18TarViS (ResNet-50)15.9YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
19DVIS++(R50, Online)15.8YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
20NOVIS (ResNet-50)15.7YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
21Tube-Link(ResNet-50)15.5NoTube-Link: A Flexible Cross Tube Framework for U...2023-03-22Code
22IDOL (ResNet-50)15NoIn Defense of Online Models for Video Instance S...2022-07-21Code
23VITA (Swin-L)14.9YesVITA: Video Instance Segmentation via Object Tok...2022-06-09Code
24InstanceFormer (Swin-L)12.9YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
25DeVIS (ResNet-50)12NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
26InstanceFormer(ResNet-50)12YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
27Mask2Former-VIS9.9NoMask2Former for Video Instance Segmentation2021-12-20Code
28STMask(R101-DCN-FPN)8.4NoSpatial Feature Calibration and Temporal Fusion ...2021-04-06Code