Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Semi-Supervised Video Object Segmentation
/
DAVIS 2017 (val)
Semi-Supervised Video Object Segmentation on DAVIS 2017 (val)
Metric: F-measure (Mean) (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
F-measure (Mean) (best first)
F-measure (Mean) (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
F-measure (Mean)
▼
Extra Data
Paper
Date
↕
Code
1
Cutie+ (base)
93.4
Yes
Putting the Object Back into Video Object Segmen...
2023-10-19
Code
2
ISVOS (BL30K, MS)
93
Yes
Look Before You Match: Instance Understanding Ma...
2022-12-13
-
3
XMem (BL30K, MS)
92.6
Yes
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
4
ISVOS (BL30K)
91.9
Yes
Look Before You Match: Instance Understanding Ma...
2022-12-13
-
5
XMem (BL30K)
91.4
Yes
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
6
Cutie (base)
91.1
No
Putting the Object Back into Video Object Segmen...
2023-10-19
Code
7
XMem (MS)
91
Yes
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
8
JIMD
91
No
Memory Matching is not Enough: Jointly Improving...
2024-09-22
-
9
DEVA
91
Yes
Tracking Anything with Decoupled Video Segmentat...
2023-09-07
Code
10
Cutie+ (base, MEGA)
90.8
Yes
Putting the Object Back into Video Object Segmen...
2023-10-19
Code
11
SwinB-AOTv2-L (MS)
89.8
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
12
SwinB-AOST (L'=3, MS)
89.5
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
13
XMem
89.5
Yes
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
14
SwinB-AOTv2-L
89.4
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
15
RAVOS
89.3
Yes
Region Aware Video Object Segmentation with Deep...
2022-07-21
-
16
SwinB-DeAOT-L
89.2
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
17
MobileVOS (BL30K)
88.9
Yes
MobileVOS: Real-Time Video Object Segmentation C...
2023-03-14
-
18
QDMN
88.6
Yes
Learning Quality-aware Dynamic Memory for Video ...
2022-07-16
Code
19
STCN
88.6
Yes
Rethinking Space-Time Networks with Improved Mem...
2021-06-09
Code
20
R50-AOST (L'=3)
88.5
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
21
TarViS
88.5
Yes
TarViS: A Unified Approach for Target-based Vide...
2023-01-06
Code
22
SwinB-AOT-L
88.4
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
23
R50-DeAOT-L
88.2
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
24
R50-AOST (L'=2)
88
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
25
XMem (DAVIS and YouTubeVOS only)
87.6
Yes
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
26
R50-AOT-L
87.5
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
27
HMMN
87.5
Yes
Hierarchical Memory Matching Network for Video O...
2021-09-23
Code
28
MiVOS
87.4
Yes
Modular Interactive Video Object Segmentation: I...
2021-03-14
Code
29
DeAOT-L
87.1
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
30
MobileVOS
87.1
No
MobileVOS: Real-Time Video Object Segmentation C...
2023-03-14
-
31
AOT-L
86.4
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
32
R50-AOST (L'=1)
86.1
No
Scalable Video Object Segmentation with Identifi...
2022-03-22
Code
33
RPCMVOS
86
No
Reliable Propagation-Correction Modulation for V...
2021-12-06
Code
34
RMNet
86
No
Efficient Regional Memory Network for Video Obje...
2021-03-24
Code
35
CFBI+
85.7
No
Collaborative Video Object Segmentation by Multi...
2020-10-13
Code
36
KMN
85.6
No
Kernelized Memory Network for Video Object Segme...
2020-07-16
Code
37
AOT-B
85.2
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
38
DeAOT-B
85.1
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
39
CFBI
84.6
No
Collaborative Video Object Segmentation by Foreg...
2020-03-18
Code
40
STM
84.3
Yes
Video Object Segmentation using Space-Time Memor...
2019-04-01
Code
41
AOT-S
83.9
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
42
DeAOT-S
83.8
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
43
DeAOT-T
83.3
No
Decoupling Features in Hierarchical Propagation ...
2022-10-18
Code
44
AOT-T
82.3
No
Associating Objects with Transformers for Video ...
2021-06-04
Code
45
PReMVOS
81.8
No
PReMVOS: Proposal-generation, Refinement and Mer...
2018-07-24
Code
46
JOINT
81.2
No
Joint Inductive and Transductive Learning for Vi...
2021-08-08
Code
47
LSMVOS
80.8
No
LSMVOS: Long-Short-Term Similarity Matching for ...
2020-09-02
Code
48
e-OSVOS
80
Yes
Make One-Shot Video Object Segmentation Efficien...
2020-12-03
Code
49
SSM-VOS
79.9
No
-
-
Code
50
SWEM
79.8
No
SWEM: Towards Real-Time Video Object Segmentatio...
2022-08-22
Code
51
XMem (DAVIS only)
79.3
No
XMem: Long-Term Video Object Segmentation with a...
2022-07-14
Code
52
MHP-VOS
78.9
No
MHP-VOS: Multiple Hypotheses Propagation for Vid...
2019-04-17
Code
53
PTSNet
77.7
No
Proposal, Tracking and Segmentation (PTS): A Cas...
2019-07-02
Code
54
AFB-URR
76.1
No
Video Object Segmentation with Adaptive Feature ...
2020-10-15
Code
55
Siam R-CNN
75
No
Siam R-CNN: Visual Tracking by Re-Detection
2019-11-28
Code
56
TVOS
74.7
No
A Transductive Approach for Video Object Segment...
2020-04-15
Code
57
CINM
74
No
CNN in MRF: Video Object Segmentation via Infere...
2018-03-26
-
58
AGAME
73.6
No
A Generative Appearance Model for End-to-end Vid...
2018-11-28
Code
59
Araslanov et al.
71.7
No
Dense Unsupervised Learning for Video Segmentation
2021-11-11
Code
60
OSVOS-S
71.3
No
Video Object Segmentation Without Temporal Infor...
2017-09-18
-
61
MAMP
71.2
No
Self-Supervised Video Object Segmentation by Mot...
2021-07-27
Code
62
AGSS-VOS
69.8
No
-
-
Code
63
OnAVOS
69.1
No
Online Adaptation of Convolutional Neural Networ...
2017-06-28
-
64
RGMP
68.6
No
-
-
Code
65
RANet
68.2
No
RANet: Ranking Attention Network for Fast Video ...
2019-08-19
Code
66
VideoMatch
68.2
No
VideoMatch: Matching based Video Object Segmenta...
2018-09-04
-
67
MAST
67.6
No
MAST: A Memory-Augmented Self-supervised Tracker
2020-02-18
Code
68
Spatiotemporal CNN
64.6
No
Spatiotemporal CNN for Video Object Segmentation
2019-04-04
Code
69
OSVOS
63.9
No
One-Shot Video Object Segmentation
2016-11-16
Code
70
RVOS
63.6
No
RVOS: End-to-End Recurrent Network for Video Obj...
2019-03-13
Code
71
VOSwL
63.5
No
Video Object Segmentation with Language Referrin...
2018-03-21
-
72
FAVOS
61.8
No
Fast and Accurate Online Video Object Segmentati...
2018-06-06
Code
73
UVC
61.3
No
Joint-task Self-supervised Learning for Temporal...
2019-09-26
Code
74
SiamMask
58.5
No
Fast Online Object Tracking and Segmentation: A ...
2018-12-12
Code
75
MuG-W
58
No
Learning Video Object Segmentation from Unlabele...
2020-03-10
Code
76
OSMN
57.1
No
Efficient Video Object Segmentation via Network ...
2018-02-04
Code
77
CorrFlow
52.2
No
Self-supervised Learning for Video Correspondenc...
2019-05-02
Code
78
CycleTime
50
No
Learning Correspondence from the Cycle-Consisten...
2019-03-18
Code
#1
Cutie+ (base)
SOTA
93.4
F-measure (Mean)
· Extra Data
· 2023-10-19
Putting the Object Back into Video Object Segmentation
Code
#2
ISVOS (BL30K, MS)
SOTA
93
F-measure (Mean)
· Extra Data
· 2022-12-13
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
#3
XMem (BL30K, MS)
SOTA
92.6
F-measure (Mean)
· Extra Data
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#4
ISVOS (BL30K)
91.9
F-measure (Mean)
· Extra Data
· 2022-12-13
Look Before You Match: Instance Understanding Matters in Video Object Segmentation
#5
XMem (BL30K)
91.4
F-measure (Mean)
· Extra Data
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#6
Cutie (base)
91.1
F-measure (Mean)
· 2023-10-19
Putting the Object Back into Video Object Segmentation
Code
#7
XMem (MS)
91
F-measure (Mean)
· Extra Data
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#8
JIMD
91
F-measure (Mean)
· 2024-09-22
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation
#9
DEVA
91
F-measure (Mean)
· Extra Data
· 2023-09-07
Tracking Anything with Decoupled Video Segmentation
Code
#10
Cutie+ (base, MEGA)
90.8
F-measure (Mean)
· Extra Data
· 2023-10-19
Putting the Object Back into Video Object Segmentation
Code
#11
SwinB-AOTv2-L (MS)
SOTA
89.8
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#12
SwinB-AOST (L'=3, MS)
89.5
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#13
XMem
89.5
F-measure (Mean)
· Extra Data
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#14
SwinB-AOTv2-L
89.4
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#15
RAVOS
89.3
F-measure (Mean)
· Extra Data
· 2022-07-21
Region Aware Video Object Segmentation with Deep Motion Modeling
#16
SwinB-DeAOT-L
89.2
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#17
MobileVOS (BL30K)
88.9
F-measure (Mean)
· Extra Data
· 2023-03-14
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation
#18
QDMN
88.6
F-measure (Mean)
· Extra Data
· 2022-07-16
Learning Quality-aware Dynamic Memory for Video Object Segmentation
Code
#19
STCN
SOTA
88.6
F-measure (Mean)
· Extra Data
· 2021-06-09
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation
Code
#20
R50-AOST (L'=3)
88.5
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#21
TarViS
88.5
F-measure (Mean)
· Extra Data
· 2023-01-06
TarViS: A Unified Approach for Target-based Video Segmentation
Code
#22
SwinB-AOT-L
SOTA
88.4
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#23
R50-DeAOT-L
88.2
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#24
R50-AOST (L'=2)
88
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#25
XMem (DAVIS and YouTubeVOS only)
87.6
F-measure (Mean)
· Extra Data
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#26
R50-AOT-L
87.5
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#27
HMMN
87.5
F-measure (Mean)
· Extra Data
· 2021-09-23
Hierarchical Memory Matching Network for Video Object Segmentation
Code
#28
MiVOS
SOTA
87.4
F-measure (Mean)
· Extra Data
· 2021-03-14
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion
Code
#29
DeAOT-L
87.1
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#30
MobileVOS
87.1
F-measure (Mean)
· 2023-03-14
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation
#31
AOT-L
86.4
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#32
R50-AOST (L'=1)
86.1
F-measure (Mean)
· 2022-03-22
Scalable Video Object Segmentation with Identification Mechanism
Code
#33
RPCMVOS
86
F-measure (Mean)
· 2021-12-06
Reliable Propagation-Correction Modulation for Video Object Segmentation
Code
#34
RMNet
86
F-measure (Mean)
· 2021-03-24
Efficient Regional Memory Network for Video Object Segmentation
Code
#35
CFBI+
SOTA
85.7
F-measure (Mean)
· 2020-10-13
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration
Code
#36
KMN
SOTA
85.6
F-measure (Mean)
· 2020-07-16
Kernelized Memory Network for Video Object Segmentation
Code
#37
AOT-B
85.2
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#38
DeAOT-B
85.1
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#39
CFBI
SOTA
84.6
F-measure (Mean)
· 2020-03-18
Collaborative Video Object Segmentation by Foreground-Background Integration
Code
#40
STM
SOTA
84.3
F-measure (Mean)
· Extra Data
· 2019-04-01
Video Object Segmentation using Space-Time Memory Networks
Code
#41
AOT-S
83.9
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#42
DeAOT-S
83.8
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#43
DeAOT-T
83.3
F-measure (Mean)
· 2022-10-18
Decoupling Features in Hierarchical Propagation for Video Object Segmentation
Code
#44
AOT-T
82.3
F-measure (Mean)
· 2021-06-04
Associating Objects with Transformers for Video Object Segmentation
Code
#45
PReMVOS
SOTA
81.8
F-measure (Mean)
· 2018-07-24
PReMVOS: Proposal-generation, Refinement and Merging for Video Object Segmentation
Code
#46
JOINT
81.2
F-measure (Mean)
· 2021-08-08
Joint Inductive and Transductive Learning for Video Object Segmentation
Code
#47
LSMVOS
80.8
F-measure (Mean)
· 2020-09-02
LSMVOS: Long-Short-Term Similarity Matching for Video Object
Code
#48
e-OSVOS
80
F-measure (Mean)
· Extra Data
· 2020-12-03
Make One-Shot Video Object Segmentation Efficient Again
Code
#49
SSM-VOS
79.9
F-measure (Mean)
No paper
Code
#50
SWEM
79.8
F-measure (Mean)
· 2022-08-22
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-Maximization
Code
#51
XMem (DAVIS only)
79.3
F-measure (Mean)
· 2022-07-14
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
Code
#52
MHP-VOS
78.9
F-measure (Mean)
· 2019-04-17
MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation
Code
#53
PTSNet
77.7
F-measure (Mean)
· 2019-07-02
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation
Code
#54
AFB-URR
76.1
F-measure (Mean)
· 2020-10-15
Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement
Code
#55
Siam R-CNN
75
F-measure (Mean)
· 2019-11-28
Siam R-CNN: Visual Tracking by Re-Detection
Code
#56
TVOS
74.7
F-measure (Mean)
· 2020-04-15
A Transductive Approach for Video Object Segmentation
Code
#57
CINM
SOTA
74
F-measure (Mean)
· 2018-03-26
CNN in MRF: Video Object Segmentation via Inference in A CNN-Based Higher-Order Spatio-Temporal MRF
#58
AGAME
73.6
F-measure (Mean)
· 2018-11-28
A Generative Appearance Model for End-to-end Video Object Segmentation
Code
#59
Araslanov et al.
71.7
F-measure (Mean)
· 2021-11-11
Dense Unsupervised Learning for Video Segmentation
Code
#60
OSVOS-S
SOTA
71.3
F-measure (Mean)
· 2017-09-18
Video Object Segmentation Without Temporal Information
#61
MAMP
71.2
F-measure (Mean)
· 2021-07-27
Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation
Code
#62
AGSS-VOS
69.8
F-measure (Mean)
No paper
Code
#63
OnAVOS
SOTA
69.1
F-measure (Mean)
· 2017-06-28
Online Adaptation of Convolutional Neural Networks for Video Object Segmentation
#64
RGMP
68.6
F-measure (Mean)
No paper
Code
#65
RANet
68.2
F-measure (Mean)
· 2019-08-19
RANet: Ranking Attention Network for Fast Video Object Segmentation
Code
#66
VideoMatch
68.2
F-measure (Mean)
· 2018-09-04
VideoMatch: Matching based Video Object Segmentation
#67
MAST
67.6
F-measure (Mean)
· 2020-02-18
MAST: A Memory-Augmented Self-supervised Tracker
Code
#68
Spatiotemporal CNN
64.6
F-measure (Mean)
· 2019-04-04
Spatiotemporal CNN for Video Object Segmentation
Code
#69
OSVOS
SOTA
63.9
F-measure (Mean)
· 2016-11-16
One-Shot Video Object Segmentation
Code
#70
RVOS
63.6
F-measure (Mean)
· 2019-03-13
RVOS: End-to-End Recurrent Network for Video Object Segmentation
Code
#71
VOSwL
63.5
F-measure (Mean)
· 2018-03-21
Video Object Segmentation with Language Referring Expressions
#72
FAVOS
61.8
F-measure (Mean)
· 2018-06-06
Fast and Accurate Online Video Object Segmentation via Tracking Parts
Code
#73
UVC
61.3
F-measure (Mean)
· 2019-09-26
Joint-task Self-supervised Learning for Temporal Correspondence
Code
#74
SiamMask
58.5
F-measure (Mean)
· 2018-12-12
Fast Online Object Tracking and Segmentation: A Unifying Approach
Code
#75
MuG-W
58
F-measure (Mean)
· 2020-03-10
Learning Video Object Segmentation from Unlabeled Videos
Code
#76
OSMN
57.1
F-measure (Mean)
· 2018-02-04
Efficient Video Object Segmentation via Network Modulation
Code
#77
CorrFlow
52.2
F-measure (Mean)
· 2019-05-02
Self-supervised Learning for Video Correspondence Flow
Code
#78
CycleTime
50
F-measure (Mean)
· 2019-03-18
Learning Correspondence from the Cycle-Consistency of Time
Code