Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Video Instance Segmentation
/
YouTube-VIS 2021
Video Instance Segmentation on YouTube-VIS 2021
Metric: AR10 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
AR10
▼
Extra Data
Paper
Date
↕
Code
1
DVIS-DAQ(VIT-L, Offline)
70.7
Yes
DVIS-DAQ: Improving Video Segmentation via Dynam...
2024-03-29
Code
2
CAVIS(VIT-L, Offline)
70.3
Yes
Context-Aware Video Instance Segmentation
2024-07-03
Code
3
DVIS++(VIT-L, Offline)
69.5
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
4
DVIS++(VIT-L, Online)
68
Yes
DVIS++: Improved Decoupled Framework for Univers...
2023-12-20
Code
5
DVIS(Swin-L)
65.7
Yes
DVIS: Decoupled Video Instance Segmentation Fram...
2023-06-06
Code
6
RefineVIS (Swin-L, online)
65.2
Yes
RefineVIS: Video Instance Segmentation with Temp...
2023-06-07
-
7
TarViS (Swin-L)
64.8
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
8
GenVIS (Swin-L)
64.7
Yes
A Generalized Framework for Video Instance Segme...
2022-11-16
Code
9
GRAtt-VIS (Swin-L)
64.5
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
10
NOVIS (Swin-L)
64.4
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
11
Tube-Link(Swin-L)
63.6
No
Tube-Link: A Flexible Cross Tube Framework for U...
2023-03-22
Code
12
UniVS(Swin-L)
63.1
Yes
UniVS: Unified and Universal Video Segmentation ...
2024-02-28
Code
13
VITA (Swin-L)
62.6
Yes
VITA: Video Instance Segmentation via Object Tok...
2022-06-09
Code
14
BoxVIS(Swin-L & Box-sup)
61
No
BoxVIS: Video Instance Segmentation with Box Ann...
2023-03-26
Code
15
MinVIS (Swin-L)
60.8
No
MinVIS: A Minimal Video Instance Segmentation Fr...
2022-08-03
Code
16
MDQE(Swin-L)
60.6
No
MDQE: Mining Discriminative Query Embeddings to ...
2023-03-25
Code
17
IDOL (Swin-L)
60.1
No
In Defense of Online Models for Video Instance S...
2022-07-21
Code
18
DeVIS (Swin-L)
57.8
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
19
TarViS (Swin-T)
57.2
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
20
InstanceFormer (Swin-L)
56
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
21
GRAtt-VIS (ResNet-50)
56
Yes
GRAtt-VIS: Gated Residual Attention for Auto Rec...
2023-05-26
Code
22
TarViS (ResNet-50)
55.9
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
23
NOVIS (ResNet-50)
54.4
Yes
NOVIS: A Case for End-to-End Near-Online Video I...
2023-08-29
-
24
DeVIS (ResNet-50)
50.1
No
DeVIS: Making Deformable Transformers Work for V...
2022-07-22
Code
25
InstanceFormer (ResNet-50)
48.1
Yes
InstanceFormer: An Online Video Instance Segment...
2022-08-22
Code
26
STMask(R101-DCN-FPN)
39.1
No
Spatial Feature Calibration and Temporal Fusion ...
2021-04-06
Code