Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video Instance Segmentation
/
OVIS validation
Video Instance Segmentation on OVIS validation
Metric: AR10 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
AR10
▼
Extra Data
Paper
Date
↕
Code
1
CAVIS(VIT-L, Offline)
61.8
Yes
Context-Aware Video Instance Segmentation
2024-07-03
Code
2
DVIS(Swin-L, Offline)
55.3
No
DVIS: Decoupled Video Instance Segmentation Fram...
2023-06-06
Code
3
DVIS++(VIT-L, Online)
54.6
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
4
DVIS(Swin-L, Online)
52.5
No
DVIS: Decoupled Video Instance Segmentation Fram...
2023-06-06
Code
5
RefineVIS (Swin-L, offline)
51.2
Yes
RefineVIS: Video Instance Segmentation with Temp...
2023-06-07
-
6
TarViS (Swin-L)
50.4
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
7
IDOL (Swin-L)
49.6
No
In Defense of Online Models for Video Instance S...
2022-07-21
Code
8
GRAtt-VIS (Swin-L)
49.4
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
9
ROVIS (Swin-L)
49.1
No
Robust Online Video Instance Segmentation with T...
2022-11-16
Code
10
GenVIS (Swin-L)
49
Yes
A Generalized Framework for Video Instance Segme...
2022-11-16
Code
11
DVIS++(R50, Offline)
47.3
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
12
NOVIS (Swin-L)
46.9
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
13
MDQE(SwinL)
46.5
No
MDQE: Mining Discriminative Query Embeddings to ...
2023-03-25
Code
14
MinVIS (Swin-L)
43.3
No
MinVIS: A Minimal Video Instance Segmentation Fr...
2022-08-03
Code
15
DVIS++(R50, Online)
42.9
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
16
TarViS (Swin-T)
40.9
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
17
GRAtt-VIS (ResNet-50)
40.1
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
18
TarViS (ResNet-50)
39.9
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
19
DeVIS (Swin-L)
39.8
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
20
IDOL (ResNet-50)
37.5
No
In Defense of Online Models for Video Instance S...
2022-07-21
Code
21
NOVIS (ResNet-50)
37.1
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
22
Tube-Link(ResNet-50)
34.5
No
Tube-Link: A Flexible Cross Tube Framework for U...
2023-03-22
Code
23
VITA (Swin-L)
33
Yes
VITA: Video Instance Segmentation via Object Tok...
2022-06-09
Code
24
InstanceFormer (Swin-L)
29.3
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
25
DeVIS (ResNet-50)
28.9
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
26
InstanceFormer(ResNet-50)
27.1
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
27
Mask2Former-VIS
24.7
No
Mask2Former for Video Instance Segmentation
2021-12-20
Code
28
STMask(R101-DCN-FPN)
23.1
No
Spatial Feature Calibration and Temporal Fusion ...
2021-04-06
Code