TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Video Instance Segmentation/OVIS validation

Video Instance Segmentation on OVIS validation

Metric: AP50 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AP50▼Extra DataPaperDate↕Code
1DVIS-DAQ(VIT-L, Offline)83.8YesDVIS-DAQ: Improving Video Segmentation via Dynam...2024-03-29Code
2CAVIS(VIT-L, Offline)82.6YesContext-Aware Video Instance Segmentation2024-07-03Code
3DVIS++(VIT-L,Offline)78.9YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
4DVIS(Swin-L, Offline)75.9NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
5DVIS++(VIT-L, Online)72.5YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
6UNINEXT (ViT-H, Online)72.5YesUniversal Instance Perception as Object Discover...2023-03-12Code
7DVIS(Swin-L, Online)71.9NoDVIS: Decoupled Video Instance Segmentation Fram...2023-06-06Code
8CTVIS (Swin-L)71.5YesCTVIS: Consistent Training for Online Video Inst...2023-07-24Code
9RefineVIS (Swin-L, offline)70.4YesRefineVIS: Video Instance Segmentation with Temp...2023-06-07-
10GenVIS (Swin-L)69.2YesA Generalized Framework for Video Instance Segme...2022-11-16Code
11GRAtt-VIS (Swin-L)69.1YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
12DVIS++(R50, Offline)68.9YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
13BoxVIS(Swin-L & Box-sup)68.4NoBoxVIS: Video Instance Segmentation with Box Ann...2023-03-26Code
14NOVIS (Swin-L)68.3YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
15TarViS (Swin-L)67.8YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
16MDQE(SwinL)67.8NoMDQE: Mining Discriminative Query Embeddings to ...2023-03-25Code
17IDOL (Swin-L)65.7NoIn Defense of Online Models for Video Instance S...2022-07-21Code
18ROVIS (Swin-L)64.7NoRobust Online Video Instance Segmentation with T...2022-11-16Code
19DVIS++(R50, Online)62.8YesDVIS++: Improved Decoupled Framework for Univers...2023-12-20Code
20MinVIS (Swin-L)61.5NoMinVIS: A Minimal Video Instance Segmentation Fr...2022-08-03Code
21GRAtt-VIS (ResNet-50)60.8YesGRAtt-VIS: Gated Residual Attention for Auto Rec...2023-05-26Code
22CTVIS (ResNet-50)60.8YesCTVIS: Consistent Training for Online Video Inst...2023-07-24Code
23DeVIS (Swin-L)59.3NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
24NOVIS (ResNet-50)56.2YesNOVIS: A Case for End-to-End Near-Online Video I...2023-08-29-
25UNINEXT (ResNet-50, Online)55.5YesUniversal Instance Perception as Object Discover...2023-03-12Code
26TarViS (Swin-T)55YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
27TarViS (ResNet-50)52.5YesTarViS: A Unified Approach for Target-based Vide...2023-01-06Code
28VITA (Swin-L)51.9YesVITA: Video Instance Segmentation via Object Tok...2022-06-09Code
29Tube-Link(ResNet-50)51.5NoTube-Link: A Flexible Cross Tube Framework for U...2023-03-22Code
30IDOL (ResNet-50)51.3NoIn Defense of Online Models for Video Instance S...2022-07-21Code
31DeVIS (ResNet-50)47.6NoDeVIS: Making Deformable Transformers Work for V...2022-07-22Code
32InstanceFormer (Swin-L)42.5YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
33InstanceFormer(ResNet-50)40.7YesInstanceFormer: An Online Video Instance Segment...2022-08-22Code
34Mask2Former-VIS36.9NoMask2Former for Video Instance Segmentation2021-12-20Code
35CrossVIS (ResNet-50, calibration)35.5NoCrossover Learning for Fast Online Video Instanc...2021-04-13Code
36STMask(R101-DCN-FPN)35.4NoSpatial Feature Calibration and Temporal Fusion ...2021-04-06Code
37TeViT (ResNet-50)34.9NoTemporally Efficient Vision Transformer for Vide...2022-04-18Code
38CMaskTrack R-CNN (ResNet-50)33.9NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code
39D2Conv3D (ResNet-50)33.8No--Code
40STC (ResNet-50)33.5NoSTC: Spatio-Temporal Contrastive Learning for Vi...2022-02-08-
41CrossVIS (ResNet-50)32.7NoCrossover Learning for Fast Online Video Instanc...2021-04-13Code
42CSipMask (ResNet-50)29.9NoOccluded Video Instance Segmentation: A Benchmark2021-02-02Code