Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video Instance Segmentation
/
OVIS validation
Video Instance Segmentation on OVIS validation
Metric: AP50 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
AP50
▼
Extra Data
Paper
Date
↕
Code
1
DVIS-DAQ(VIT-L, Offline)
83.8
Yes
DVIS-DAQ: Improving Video Segmentation via Dynam...
2024-03-29
Code
2
CAVIS(VIT-L, Offline)
82.6
Yes
Context-Aware Video Instance Segmentation
2024-07-03
Code
3
DVIS++(VIT-L,Offline)
78.9
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
4
DVIS(Swin-L, Offline)
75.9
No
DVIS: Decoupled Video Instance Segmentation Fram...
2023-06-06
Code
5
DVIS++(VIT-L, Online)
72.5
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
6
UNINEXT (ViT-H, Online)
72.5
Yes
Universal Instance Perception as Object Discover...
2023-03-12
Code
7
DVIS(Swin-L, Online)
71.9
No
DVIS: Decoupled Video Instance Segmentation Fram...
2023-06-06
Code
8
CTVIS (Swin-L)
71.5
Yes
CTVIS: Consistent Training for Online Video Inst...
2023-07-24
Code
9
RefineVIS (Swin-L, offline)
70.4
Yes
RefineVIS: Video Instance Segmentation with Temp...
2023-06-07
-
10
GenVIS (Swin-L)
69.2
Yes
A Generalized Framework for Video Instance Segme...
2022-11-16
Code
11
GRAtt-VIS (Swin-L)
69.1
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
12
DVIS++(R50, Offline)
68.9
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
13
BoxVIS(Swin-L & Box-sup)
68.4
No
BoxVIS: Video Instance Segmentation with Box Ann...
2023-03-26
Code
14
NOVIS (Swin-L)
68.3
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
15
TarViS (Swin-L)
67.8
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
16
MDQE(SwinL)
67.8
No
MDQE: Mining Discriminative Query Embeddings to ...
2023-03-25
Code
17
IDOL (Swin-L)
65.7
No
In Defense of Online Models for Video Instance S...
2022-07-21
Code
18
ROVIS (Swin-L)
64.7
No
Robust Online Video Instance Segmentation with T...
2022-11-16
Code
19
DVIS++(R50, Online)
62.8
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
20
MinVIS (Swin-L)
61.5
No
MinVIS: A Minimal Video Instance Segmentation Fr...
2022-08-03
Code
21
GRAtt-VIS (ResNet-50)
60.8
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
22
CTVIS (ResNet-50)
60.8
Yes
CTVIS: Consistent Training for Online Video Inst...
2023-07-24
Code
23
DeVIS (Swin-L)
59.3
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
24
NOVIS (ResNet-50)
56.2
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
25
UNINEXT (ResNet-50, Online)
55.5
Yes
Universal Instance Perception as Object Discover...
2023-03-12
Code
26
TarViS (Swin-T)
55
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
27
TarViS (ResNet-50)
52.5
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
28
VITA (Swin-L)
51.9
Yes
VITA: Video Instance Segmentation via Object Tok...
2022-06-09
Code
29
Tube-Link(ResNet-50)
51.5
No
Tube-Link: A Flexible Cross Tube Framework for U...
2023-03-22
Code
30
IDOL (ResNet-50)
51.3
No
In Defense of Online Models for Video Instance S...
2022-07-21
Code
31
DeVIS (ResNet-50)
47.6
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
32
InstanceFormer (Swin-L)
42.5
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
33
InstanceFormer(ResNet-50)
40.7
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
34
Mask2Former-VIS
36.9
No
Mask2Former for Video Instance Segmentation
2021-12-20
Code
35
CrossVIS (ResNet-50, calibration)
35.5
No
Crossover Learning for Fast Online Video Instanc...
2021-04-13
Code
36
STMask(R101-DCN-FPN)
35.4
No
Spatial Feature Calibration and Temporal Fusion ...
2021-04-06
Code
37
TeViT (ResNet-50)
34.9
No
Temporally Efficient Vision Transformer for Vide...
2022-04-18
Code
38
CMaskTrack R-CNN (ResNet-50)
33.9
No
Occluded Video Instance Segmentation: A Benchmark
2021-02-02
Code
39
D2Conv3D (ResNet-50)
33.8
No
-
-
Code
40
STC (ResNet-50)
33.5
No
STC: Spatio-Temporal Contrastive Learning for Vi...
2022-02-08
-
41
CrossVIS (ResNet-50)
32.7
No
Crossover Learning for Fast Online Video Instanc...
2021-04-13
Code
42
CSipMask (ResNet-50)
29.9
No
Occluded Video Instance Segmentation: A Benchmark
2021-02-02
Code