TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Instance Segmentation/OVIS validation

Video Instance Segmentation on OVIS validation

Metric: AR10 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AR10▼Extra DataPaperDate↕Code
1CAVIS(VIT-L, Offline)61.8YesContext-Aware Video Instance Segmentation2024-07-03Code
2DVIS(Swin-L, Offline)55.3NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
3DVIS++(VIT-L, Online)54.6YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
4DVIS(Swin-L, Online)52.5NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
5RefineVIS (Swin-L, offline)51.2YesRefineVIS: Video Instance Segmentation with Temp...2023-06-07-
6TarViS (Swin-L)50.4YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
7IDOL (Swin-L)49.6NoIn Defense of Online Models for Video Instance S...2022-07-21Code
8GRAtt-VIS (Swin-L)49.4YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
9ROVIS (Swin-L)49.1NoRobust Online Video Instance Segmentation with T...2022-11-16Code
10GenVIS (Swin-L)49YesA Generalized Framework for Video Instance Segme...2022-11-16Code
11DVIS++(R50, Offline)47.3YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
12NOVIS (Swin-L)46.9YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
13MDQE(SwinL)46.5NoMDQE: Mining Discriminative Query Embeddings to ...2023-03-25Code
14MinVIS (Swin-L)43.3NoMinVIS: A Minimal Video Instance Segmentation Fr...2022-08-03Code
15DVIS++(R50, Online)42.9YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
16TarViS (Swin-T)40.9YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
17GRAtt-VIS (ResNet-50)40.1YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
18TarViS (ResNet-50)39.9YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
19DeVIS (Swin-L)39.8NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
20IDOL (ResNet-50)37.5NoIn Defense of Online Models for Video Instance S...2022-07-21Code
21NOVIS (ResNet-50)37.1YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
22Tube-Link(ResNet-50)34.5NoTube-Link: A Flexible Cross Tube Framework for U...2023-03-22Code
23VITA (Swin-L)33YesVITA: Video Instance Segmentation via Object Tok...2022-06-09Code
24InstanceFormer (Swin-L)29.3YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
25DeVIS (ResNet-50)28.9NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
26InstanceFormer(ResNet-50)27.1YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
27Mask2Former-VIS24.7NoMask2Former for Video Instance Segmentation2021-12-20Code
28STMask(R101-DCN-FPN)23.1NoSpatial Feature Calibration and Temporal Fusion ...2021-04-06Code