TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Quality-aware Dynamic Memory for Video Object Seg...

Learning Quality-aware Dynamic Memory for Video Object Segmentation

Yong liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang

2022-07-16Semi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

Recently, several spatial-temporal memory-based methods have verified that storing intermediate frames and their masks as memory are helpful to segment target objects in videos. However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory. Therefore, frames with poor segmentation masks are prone to be memorized, which leads to a segmentation mask error accumulation problem and further affect the segmentation performance. In addition, the linear increase of memory frames with the growth of frame number also limits the ability of the models to handle long videos. To this end, we propose a Quality-aware Dynamic Memory Network (QDMN) to evaluate the segmentation quality of each frame, allowing the memory bank to selectively store accurately segmented frames to prevent the error accumulation problem. Then, we combine the segmentation quality with temporal consistency to dynamically update the memory bank to improve the practicability of the models. Without any bells and whistles, our QDMN achieves new state-of-the-art performance on both DAVIS and YouTube-VOS benchmarks. Moreover, extensive experiments demonstrate that the proposed Quality Assessment Module (QAM) can be applied to memory-based methods as generic plugins and significantly improves performance. Our source code is available at https://github.com/workforai/QDMN.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)88.6QDMN
VideoDAVIS 2017 (val)J&F85.6QDMN
VideoDAVIS 2017 (val)Jaccard (Mean)82.5QDMN
VideoDAVIS 2016F-measure (Mean)93.2QDMN
VideoDAVIS 2016J&F92QDMN
VideoDAVIS 2016Jaccard (Mean)90.7QDMN
VideoDAVIS 2017 (test-dev)F-measure (Mean)85.4QDMN
VideoDAVIS 2017 (test-dev)J&F81.9QDMN
VideoDAVIS 2017 (test-dev)Jaccard (Mean)78.1QDMN
VideoYouTube-VOS 2018F-Measure (Seen)87.5QDMN
VideoYouTube-VOS 2018F-Measure (Unseen)86.4QDMN
VideoYouTube-VOS 2018Jaccard (Seen)82.7QDMN
VideoYouTube-VOS 2018Jaccard (Unseen)78.4QDMN
VideoYouTube-VOS 2018Overall83.8QDMN
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)88.6QDMN
Video Object SegmentationDAVIS 2017 (val)J&F85.6QDMN
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)82.5QDMN
Video Object SegmentationDAVIS 2016F-measure (Mean)93.2QDMN
Video Object SegmentationDAVIS 2016J&F92QDMN
Video Object SegmentationDAVIS 2016Jaccard (Mean)90.7QDMN
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)85.4QDMN
Video Object SegmentationDAVIS 2017 (test-dev)J&F81.9QDMN
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)78.1QDMN
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)87.5QDMN
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)86.4QDMN
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)82.7QDMN
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)78.4QDMN
Video Object SegmentationYouTube-VOS 2018Overall83.8QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)88.6QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F85.6QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)82.5QDMN
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)93.2QDMN
Semi-Supervised Video Object SegmentationDAVIS 2016J&F92QDMN
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)90.7QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)85.4QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F81.9QDMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)78.1QDMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)87.5QDMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)86.4QDMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)82.7QDMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)78.4QDMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall83.8QDMN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17