| 1 | Cutie+ (base) | 87.5 | Yes | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 2 | ISVOS (BL30K, MS) | 86.7 | Yes | Look Before You Match: Instance Understanding Ma... | 2022-12-13 | - |
| 3 | XMem (BL30K, MS) | 86.3 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 4 | ISVOS (MS) | 85.8 | Yes | Look Before You Match: Instance Understanding Ma... | 2022-12-13 | - |
| 5 | Cutie+ (base, MEGA) | 85.5 | Yes | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 6 | XMem (MS) | 85.4 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 7 | JIMD | 85.2 | No | Memory Matching is not Enough: Jointly Improving... | 2024-09-22 | - |
| 8 | Cutie (base) | 84.6 | No | Putting the Object Back into Video Object Segmen... | 2023-10-19 | Code |
| 9 | ISVOS (BL30K) | 84.5 | Yes | Look Before You Match: Instance Understanding Ma... | 2022-12-13 | - |
| 10 | DEVA | 84.2 | Yes | Tracking Anything with Decoupled Video Segmentat... | 2023-09-07 | Code |
| 11 | SwinB-AOTv2-L (MS) | 84.2 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 12 | XMem (BL30K) | 84 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 13 | SwinB-AOST (L'=3, MS) | 83.8 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 14 | SwinB-AOTv2-L | 83.1 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 15 | SwinB-DeAOT-L | 83.1 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 16 | XMem | 82.9 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 17 | RAVOS | 82.9 | Yes | Region Aware Video Object Segmentation with Deep... | 2022-07-21 | - |
| 18 | R50-AOST (L'=3) | 82.6 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 19 | QDMN | 82.5 | Yes | Learning Quality-aware Dynamic Memory for Video ... | 2022-07-16 | Code |
| 20 | R50-AOST (L'=2) | 82.5 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 21 | SwinB-AOT-L | 82.4 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 22 | R50-AOT-L | 82.3 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 23 | R50-DeAOT-L | 82.2 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 24 | STCN | 82 | Yes | Rethinking Space-Time Networks with Improved Mem... | 2021-06-09 | Code |
| 25 | HMMN | 81.9 | Yes | Hierarchical Memory Matching Network for Video O... | 2021-09-23 | Code |
| 26 | TarViS | 81.7 | Yes | TarViS: A Unified Approach for Target-based Vide... | 2023-01-06 | Code |
| 27 | MiVOS | 81.7 | Yes | Modular Interactive Video Object Segmentation: I... | 2021-03-14 | Code |
| 28 | XMem (DAVIS and YouTubeVOS only) | 81.4 | Yes | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 29 | RPCMVOS | 81.3 | No | Reliable Propagation-Correction Modulation for V... | 2021-12-06 | Code |
| 30 | R50-AOST (L'=1) | 81.2 | No | Scalable Video Object Segmentation with Identifi... | 2022-03-22 | Code |
| 31 | AOT-L | 81.1 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 32 | DeAOT-L | 81 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 33 | RMNet | 81 | No | Efficient Regional Memory Network for Video Obje... | 2021-03-24 | Code |
| 34 | CFBI+ | 80.1 | No | Collaborative Video Object Segmentation by Multi... | 2020-10-13 | Code |
| 35 | KMN | 80 | No | Kernelized Memory Network for Video Object Segme... | 2020-07-16 | Code |
| 36 | AOT-B | 79.7 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 37 | DeAOT-B | 79.2 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 38 | STM | 79.2 | Yes | Video Object Segmentation using Space-Time Memor... | 2019-04-01 | Code |
| 39 | CFBI | 79.1 | No | Collaborative Video Object Segmentation by Foreg... | 2020-03-18 | Code |
| 40 | AOT-S | 78.7 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 41 | DeAOT-S | 77.8 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 42 | DeAOT-T | 77.7 | No | Decoupling Features in Hierarchical Propagation ... | 2022-10-18 | Code |
| 43 | AOT-T | 77.4 | No | Associating Objects with Transformers for Video ... | 2021-06-04 | Code |
| 44 | JOINT | 76 | No | Joint Inductive and Transductive Learning for Vi... | 2021-08-08 | Code |
| 45 | SSM-VOS | 75.3 | No | - | - | Code |
| 46 | SWEM | 74.5 | No | SWEM: Towards Real-Time Video Object Segmentatio... | 2022-08-22 | Code |
| 47 | e-OSVOS | 74.4 | Yes | Make One-Shot Video Object Segmentation Efficien... | 2020-12-03 | Code |
| 48 | XMem (DAVIS only) | 74.1 | No | XMem: Long-Term Video Object Segmentation with a... | 2022-07-14 | Code |
| 49 | PReMVOS | 73.9 | No | PReMVOS: Proposal-generation, Refinement and Mer... | 2018-07-24 | Code |
| 50 | LSMVOS | 73.9 | No | LSMVOS: Long-Short-Term Similarity Matching for ... | 2020-09-02 | Code |
| 51 | MHP-VOS | 73.4 | No | MHP-VOS: Multiple Hypotheses Propagation for Vid... | 2019-04-17 | Code |
| 52 | AFB-URR | 73 | No | Video Object Segmentation with Adaptive Feature ... | 2020-10-15 | Code |
| 53 | PTSNet | 71.6 | No | Proposal, Tracking and Segmentation (PTS): A Cas... | 2019-07-02 | Code |
| 54 | TVOS | 69.9 | No | A Transductive Approach for Video Object Segment... | 2020-04-15 | Code |
| 55 | AGAME | 68.5 | No | A Generative Appearance Model for End-to-end Vid... | 2018-11-28 | Code |
| 56 | MAMP | 68.3 | No | Self-Supervised Video Object Segmentation by Mot... | 2021-07-27 | Code |
| 57 | CINM | 67.2 | No | CNN in MRF: Video Object Segmentation via Infere... | 2018-03-26 | - |
| 58 | Araslanov et al. | 67.1 | No | Dense Unsupervised Learning for Video Segmentation | 2021-11-11 | Code |
| 59 | Siam R-CNN | 66.1 | No | Siam R-CNN: Visual Tracking by Re-Detection | 2019-11-28 | Code |
| 60 | RGMP | 64.8 | No | - | - | Code |
| 61 | OSVOS-S | 64.7 | No | Video Object Segmentation Without Temporal Infor... | 2017-09-18 | - |
| 62 | AGSS-VOS | 63.4 | No | - | - | Code |
| 63 | MAST | 63.3 | No | MAST: A Memory-Augmented Self-supervised Tracker | 2020-02-18 | Code |
| 64 | RANet | 63.2 | No | RANet: Ranking Attention Network for Fast Video ... | 2019-08-19 | Code |
| 65 | OnAVOS | 61.6 | No | Online Adaptation of Convolutional Neural Networ... | 2017-06-28 | - |
| 66 | Spatiotemporal CNN | 58.7 | No | Spatiotemporal CNN for Video Object Segmentation | 2019-04-04 | Code |
| 67 | VOSwL (Language) | 58 | No | Video Object Segmentation with Language Referrin... | 2018-03-21 | - |
| 68 | UVC | 57.7 | No | Joint-task Self-supervised Learning for Temporal... | 2019-09-26 | Code |
| 69 | RVOS | 57.5 | No | RVOS: End-to-End Recurrent Network for Video Obj... | 2019-03-13 | Code |
| 70 | OSVOS | 56.6 | No | One-Shot Video Object Segmentation | 2016-11-16 | Code |
| 71 | VideoMatch | 56.5 | No | VideoMatch: Matching based Video Object Segmenta... | 2018-09-04 | - |
| 72 | FAVOS | 54.6 | No | Fast and Accurate Online Video Object Segmentati... | 2018-06-06 | Code |
| 73 | SiamMask | 54.3 | No | Fast Online Object Tracking and Segmentation: A ... | 2018-12-12 | Code |
| 74 | MuG-W | 54.1 | No | Learning Video Object Segmentation from Unlabele... | 2020-03-10 | Code |
| 75 | OSMN | 52.5 | No | Efficient Video Object Segmentation via Network ... | 2018-02-04 | Code |
| 76 | CorrFlow | 48.4 | No | Self-supervised Learning for Video Correspondenc... | 2019-05-02 | Code |
| 77 | CycleTime | 46.4 | No | Learning Correspondence from the Cycle-Consisten... | 2019-03-18 | Code |