Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Qiang Zhou, Zilong Huang, Lichao Huang, Yongchao Gong, Han Shen, Chang Huang, Wenyu Liu, Xinggang Wang

2019-07-02Semi-Supervised Video Object Segmentation One-shot visual object segmentation Segmentation Semantic Segmentation Video Object Segmentation Object Tracking Video Semantic Segmentation

Paper PDF Code(official)

Abstract

Video object segmentation (VOS) aims at pixel-level object tracking given only the annotations in the first frame. Due to the large visual variations of objects in video and the lack of training samples, it remains a difficult task despite the upsurging development of deep learning. Toward solving the VOS problem, we bring in several new insights by the proposed unified framework consisting of object proposal, tracking and segmentation components. The object proposal network transfers objectness information as generic knowledge into VOS; the tracking network identifies the target object from the proposals; and the segmentation network is performed based on the tracking results with a novel dynamic-reference based model adaptation scheme. Extensive experiments have been conducted on the DAVIS'17 dataset and the YouTube-VOS dataset, our method achieves the state-of-the-art performance on several video object segmentation benchmarks. We make the code publicly available at https://github.com/sydney0zq/PTSNet.

Results

Task	Dataset	Metric	Value	Model
Video	DAVIS 2017 (val)	F-measure (Mean)	77.7	PTSNet
Video	DAVIS 2017 (val)	J&F	74.65	PTSNet
Video	DAVIS 2017 (val)	Jaccard (Mean)	71.6	PTSNet
Object Tracking	YouTube-VOS 2018	Jaccard (Seen)	73.5	PTSNet
Object Tracking	YouTube-VOS 2018	Jaccard (Unseen)	64.3	PTSNet
Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	77.7	PTSNet
Video Object Segmentation	DAVIS 2017 (val)	J&F	74.65	PTSNet
Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	71.6	PTSNet
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	77.7	PTSNet
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	J&F	74.65	PTSNet
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	71.6	PTSNet
Visual Object Tracking	YouTube-VOS 2018	Jaccard (Seen)	73.5	PTSNet
Visual Object Tracking	YouTube-VOS 2018	Jaccard (Unseen)	64.3	PTSNet

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Abstract

Results

Related Papers

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Abstract

Results

Related Papers