Learning Quality-aware Dynamic Memory for Video Object Segmentation

Yong liu, Ran Yu, Fei Yin, Xinyuan Zhao, Wei Zhao, Weihao Xia, Yujiu Yang

2022-07-16Semi-Supervised Video Object Segmentation Segmentation Semantic Segmentation Video Object Segmentation Video Semantic Segmentation

Paper PDF Code(official)

Abstract

Recently, several spatial-temporal memory-based methods have verified that storing intermediate frames and their masks as memory are helpful to segment target objects in videos. However, they mainly focus on better matching between the current frame and the memory frames without explicitly paying attention to the quality of the memory. Therefore, frames with poor segmentation masks are prone to be memorized, which leads to a segmentation mask error accumulation problem and further affect the segmentation performance. In addition, the linear increase of memory frames with the growth of frame number also limits the ability of the models to handle long videos. To this end, we propose a Quality-aware Dynamic Memory Network (QDMN) to evaluate the segmentation quality of each frame, allowing the memory bank to selectively store accurately segmented frames to prevent the error accumulation problem. Then, we combine the segmentation quality with temporal consistency to dynamically update the memory bank to improve the practicability of the models. Without any bells and whistles, our QDMN achieves new state-of-the-art performance on both DAVIS and YouTube-VOS benchmarks. Moreover, extensive experiments demonstrate that the proposed Quality Assessment Module (QAM) can be applied to memory-based methods as generic plugins and significantly improves performance. Our source code is available at https://github.com/workforai/QDMN.

Results

Task	Dataset	Metric	Value	Model
Video	DAVIS 2017 (val)	F-measure (Mean)	88.6	QDMN
Video	DAVIS 2017 (val)	J&F	85.6	QDMN
Video	DAVIS 2017 (val)	Jaccard (Mean)	82.5	QDMN
Video	DAVIS 2016	F-measure (Mean)	93.2	QDMN
Video	DAVIS 2016	J&F	92	QDMN
Video	DAVIS 2016	Jaccard (Mean)	90.7	QDMN
Video	DAVIS 2017 (test-dev)	F-measure (Mean)	85.4	QDMN
Video	DAVIS 2017 (test-dev)	J&F	81.9	QDMN
Video	DAVIS 2017 (test-dev)	Jaccard (Mean)	78.1	QDMN
Video	YouTube-VOS 2018	F-Measure (Seen)	87.5	QDMN
Video	YouTube-VOS 2018	F-Measure (Unseen)	86.4	QDMN
Video	YouTube-VOS 2018	Jaccard (Seen)	82.7	QDMN
Video	YouTube-VOS 2018	Jaccard (Unseen)	78.4	QDMN
Video	YouTube-VOS 2018	Overall	83.8	QDMN
Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	88.6	QDMN
Video Object Segmentation	DAVIS 2017 (val)	J&F	85.6	QDMN
Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	82.5	QDMN
Video Object Segmentation	DAVIS 2016	F-measure (Mean)	93.2	QDMN
Video Object Segmentation	DAVIS 2016	J&F	92	QDMN
Video Object Segmentation	DAVIS 2016	Jaccard (Mean)	90.7	QDMN
Video Object Segmentation	DAVIS 2017 (test-dev)	F-measure (Mean)	85.4	QDMN
Video Object Segmentation	DAVIS 2017 (test-dev)	J&F	81.9	QDMN
Video Object Segmentation	DAVIS 2017 (test-dev)	Jaccard (Mean)	78.1	QDMN
Video Object Segmentation	YouTube-VOS 2018	F-Measure (Seen)	87.5	QDMN
Video Object Segmentation	YouTube-VOS 2018	F-Measure (Unseen)	86.4	QDMN
Video Object Segmentation	YouTube-VOS 2018	Jaccard (Seen)	82.7	QDMN
Video Object Segmentation	YouTube-VOS 2018	Jaccard (Unseen)	78.4	QDMN
Video Object Segmentation	YouTube-VOS 2018	Overall	83.8	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	88.6	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	J&F	85.6	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	82.5	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2016	F-measure (Mean)	93.2	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2016	J&F	92	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2016	Jaccard (Mean)	90.7	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	F-measure (Mean)	85.4	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	J&F	81.9	QDMN
Semi-Supervised Video Object Segmentation	DAVIS 2017 (test-dev)	Jaccard (Mean)	78.1	QDMN
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	F-Measure (Seen)	87.5	QDMN
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	F-Measure (Unseen)	86.4	QDMN
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	Jaccard (Seen)	82.7	QDMN
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	Jaccard (Unseen)	78.4	QDMN
Semi-Supervised Video Object Segmentation	YouTube-VOS 2018	Overall	83.8	QDMN

Learning Quality-aware Dynamic Memory for Video Object Segmentation

Abstract

Results

Related Papers

Learning Quality-aware Dynamic Memory for Video Object Segmentation

Abstract

Results

Related Papers