| 1 | Cutie+ (base) | 93.4 | Yes | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 2 | ISVOS (BL30K, MS) | 93 | Yes | Look Before You Match: Instance Understanding Ma... | 2022-12-13 | - |
| 3 | XMem (BL30K, MS) | 92.6 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 4 | ISVOS (BL30K) | 91.9 | Yes | Look Before You Match: Instance Understanding Ma... | 2022-12-13 | - |
| 5 | XMem (BL30K) | 91.4 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 6 | Cutie (base) | 91.1 | No | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 7 | XMem (MS) | 91 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 8 | JIMD | 91 | No | Memory Matching is not Enough: Jointly Improving... | 2024-09-22 | - |
| 9 | DEVA | 91 | Yes | Tracking Anything with Decoupled Video Segmentat... | 2023-09-07 | Code |
| 10 | Cutie+ (base, MEGA) | 90.8 | Yes | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 11 | SwinB-AOTv2-L (MS) | 89.8 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 12 | SwinB-AOST (L'=3, MS) | 89.5 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 13 | XMem | 89.5 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 14 | SwinB-AOTv2-L | 89.4 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 15 | RAVOS | 89.3 | Yes | Region Aware Video Object Segmentation with Deep... | 2022-07-21 | - |
| 16 | SwinB-DeAOT-L | 89.2 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 17 | MobileVOS (BL30K) | 88.9 | Yes | MobileVOS: Real-Time Video Object Segmentation C... | 2023-03-14 | - |
| 18 | QDMN | 88.6 | Yes | Learning Quality-aware Dynamic Memory for Video ... | 2022-07-16 | Code |
| 19 | STCN | 88.6 | Yes | Rethinking Space-Time Networks with Improved Mem... | 2021-06-09 | Code |
| 20 | R50-AOST (L'=3) | 88.5 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 21 | TarViS | 88.5 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 22 | SwinB-AOT-L | 88.4 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 23 | R50-DeAOT-L | 88.2 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 24 | R50-AOST (L'=2) | 88 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 25 | XMem (DAVIS and YouTubeVOS only) | 87.6 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 26 | R50-AOT-L | 87.5 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 27 | HMMN | 87.5 | Yes | Hierarchical Memory Matching Network for Video O... | 2021-09-23 | Code |
| 28 | MiVOS | 87.4 | Yes | Modular Interactive Video Object Segmentation: I... | 2021-03-14 | Code |
| 29 | DeAOT-L | 87.1 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 30 | MobileVOS | 87.1 | No | MobileVOS: Real-Time Video Object Segmentation C... | 2023-03-14 | - |
| 31 | AOT-L | 86.4 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 32 | R50-AOST (L'=1) | 86.1 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 33 | RPCMVOS | 86 | No | Reliable Propagation-Correction Modulation for V... | 2021-12-06 | Code |
| 34 | RMNet | 86 | No | Efficient Regional Memory Network for Video Obje... | 2021-03-24 | Code |
| 35 | CFBI+ | 85.7 | No | Collaborative Video Object Segmentation by Multi... | 2020-10-13 | Code |
| 36 | KMN | 85.6 | No | Kernelized Memory Network for Video Object Segme... | 2020-07-16 | Code |
| 37 | AOT-B | 85.2 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 38 | DeAOT-B | 85.1 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 39 | CFBI | 84.6 | No | Collaborative Video Object Segmentation by Foreg... | 2020-03-18 | Code |
| 40 | STM | 84.3 | Yes | Video Object Segmentation using Space-Time Memor... | 2019-04-01 | Code |
| 41 | AOT-S | 83.9 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 42 | DeAOT-S | 83.8 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 43 | DeAOT-T | 83.3 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 44 | AOT-T | 82.3 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 45 | PReMVOS | 81.8 | No | PReMVOS: Proposal-generation, Refinement and Mer... | 2018-07-24 | Code |
| 46 | JOINT | 81.2 | No | Joint Inductive and Transductive Learning for Vi... | 2021-08-08 | Code |
| 47 | LSMVOS | 80.8 | No | LSMVOS: Long-Short-Term Similarity Matching for ... | 2020-09-02 | Code |
| 48 | e-OSVOS | 80 | Yes | Make One-Shot Video Object Segmentation Efficien... | 2020-12-03 | Code |
| 49 | SSM-VOS | 79.9 | No | - | - | Code |
| 50 | SWEM | 79.8 | No | SWEM: Towards Real-Time Video Object Segmentatio... | 2022-08-22 | Code |
| 51 | XMem (DAVIS only) | 79.3 | No | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 52 | MHP-VOS | 78.9 | No | MHP-VOS: Multiple Hypotheses Propagation for Vid... | 2019-04-17 | Code |
| 53 | PTSNet | 77.7 | No | Proposal, Tracking and Segmentation (PTS): A Cas... | 2019-07-02 | Code |
| 54 | DEVA (EntitySeg) | 76.4 | Yes | Tracking Anything with Decoupled Video Segmentat... | 2023-09-07 | Code |
| 55 | AFB-URR | 76.1 | No | Video Object Segmentation with Adaptive Feature ... | 2020-10-15 | Code |
| 56 | Siam R-CNN | 75 | No | Siam R-CNN: Visual Tracking by Re-Detection | 2019-11-28 | Code |
| 57 | TVOS | 74.7 | No | A Transductive Approach for Video Object Segment... | 2020-04-15 | Code |
| 58 | CINM | 74 | No | CNN in MRF: Video Object Segmentation via Infere... | 2018-03-26 | - |
| 59 | Propose-Reduce | 73.8 | Yes | Video Instance Segmentation with a Propose-Reduc... | 2021-03-25 | Code |
| 60 | AGAME | 73.6 | No | A Generative Appearance Model for End-to-end Vid... | 2018-11-28 | Code |
| 61 | Araslanov et al. | 71.7 | No | Dense Unsupervised Learning for Video Segmentation | 2021-11-11 | Code |
| 62 | OSVOS-S | 71.3 | No | Video Object Segmentation Without Temporal Infor... | 2017-09-18 | - |
| 63 | MAMP | 71.2 | No | Self-Supervised Video Object Segmentation by Mot... | 2021-07-27 | Code |
| 64 | AGSS-VOS | 69.8 | No | - | - | Code |
| 65 | UnOVOST | 69.3 | No | UnOVOST: Unsupervised Offline Video Object Segme... | 2020-01-15 | Code |
| 66 | OnAVOS | 69.1 | No | Online Adaptation of Convolutional Neural Networ... | 2017-06-28 | - |
| 67 | RGMP | 68.6 | No | - | - | Code |
| 68 | RANet | 68.2 | No | RANet: Ranking Attention Network for Fast Video ... | 2019-08-19 | Code |
| 69 | VideoMatch | 68.2 | No | VideoMatch: Matching based Video Object Segmenta... | 2018-09-04 | - |
| 70 | STEm-Seg | 67.8 | Yes | STEm-Seg: Spatio-temporal Embeddings for Instanc... | 2020-03-18 | Code |
| 71 | MAST | 67.6 | No | MAST: A Memory-Augmented Self-supervised Tracker | 2020-02-18 | Code |
| 72 | MAST | 67.6 | Yes | MAST: A Memory-Augmented Self-supervised Tracker | 2020-02-18 | Code |
| 73 | Spatiotemporal CNN | 64.6 | No | Spatiotemporal CNN for Video Object Segmentation | 2019-04-04 | Code |
| 74 | OSVOS | 63.9 | No | One-Shot Video Object Segmentation | 2016-11-16 | Code |
| 75 | RVOS | 63.6 | No | RVOS: End-to-End Recurrent Network for Video Obj... | 2019-03-13 | Code |
| 76 | VOSwL | 63.5 | No | Video Object Segmentation with Language Referrin... | 2018-03-21 | - |
| 77 | FAVOS | 61.8 | No | Fast and Accurate Online Video Object Segmentati... | 2018-06-06 | Code |
| 78 | UVC | 61.3 | No | Joint-task Self-supervised Learning for Temporal... | 2019-09-26 | Code |
| 79 | MATNet | 60.4 | No | - | - | Code |
| 80 | ALBA | 60.2 | No | ALBA : Reinforcement Learning for Video Object S... | 2020-05-26 | Code |
| 81 | AGS | 59.5 | Yes | - | - | Code |
| 82 | SiamMask | 58.5 | No | Fast Online Object Tracking and Segmentation: A ... | 2018-12-12 | Code |
| 83 | MuG-W | 58 | No | Learning Video Object Segmentation from Unlabele... | 2020-03-10 | Code |
| 84 | OSMN | 57.1 | No | Efficient Video Object Segmentation via Network ... | 2018-02-04 | Code |
| 85 | PDB | 57 | No | - | - | - |
| 86 | CorrFlow | 52.2 | No | Self-supervised Learning for Video Correspondenc... | 2019-05-02 | Code |
| 87 | CycleTime | 50 | No | Learning Correspondence from the Cycle-Consisten... | 2019-03-18 | Code |
| 88 | RVOS | 45.7 | No | RVOS: End-to-End Recurrent Network for Video Obj... | 2019-03-13 | Code |