TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Supervised Video Object Segmentation by Motion-Aware ...

Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation

Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Ajmal Mian

2021-07-27Semi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

We propose a self-supervised spatio-temporal matching method, coined Motion-Aware Mask Propagation (MAMP), for video object segmentation. MAMP leverages the frame reconstruction task for training without the need for annotations. During inference, MAMP extracts high-resolution features from each frame to build a memory bank from the features as well as the predicted masks of selected past frames. MAMP then propagates the masks from the memory bank to subsequent frames according to our proposed motion-aware spatio-temporal matching module to handle fast motion and long-term matching scenarios. Evaluation on DAVIS-2017 and YouTube-VOS datasets show that MAMP achieves state-of-the-art performance with stronger generalization ability compared to existing self-supervised methods, i.e., 4.2% higher mean J&F on DAVIS-2017 and 4.85% higher mean J&F on the unseen categories of YouTube-VOS than the nearest competitor. Moreover, MAMP performs at par with many supervised video object segmentation methods. Our code is available at: https://github.com/bo-miao/MAMP.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)71.2MAMP
VideoDAVIS 2017 (val)J&F69.7MAMP
VideoDAVIS 2017 (val)Jaccard (Mean)68.3MAMP
VideoYouTube-VOS 2018F-Measure (Seen)68.4MAMP
VideoYouTube-VOS 2018F-Measure (Unseen)73.2MAMP
VideoYouTube-VOS 2018Jaccard (Seen)67MAMP
VideoYouTube-VOS 2018Jaccard (Unseen)64.5MAMP
VideoYouTube-VOS 2018Overall68.2MAMP
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.2MAMP
Video Object SegmentationDAVIS 2017 (val)J&F69.7MAMP
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)68.3MAMP
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)68.4MAMP
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)73.2MAMP
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)67MAMP
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)64.5MAMP
Video Object SegmentationYouTube-VOS 2018Overall68.2MAMP
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.2MAMP
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F69.7MAMP
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)68.3MAMP
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)68.4MAMP
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)73.2MAMP
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)67MAMP
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)64.5MAMP
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall68.2MAMP

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17