TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Proposal, Tracking and Segmentation (PTS): A Cascaded Netw...

Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object Segmentation

Qiang Zhou, Zilong Huang, Lichao Huang, Yongchao Gong, Han Shen, Chang Huang, Wenyu Liu, Xinggang Wang

2019-07-02Semi-Supervised Video Object SegmentationOne-shot visual object segmentationSegmentationSemantic SegmentationVideo Object SegmentationObject TrackingVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

Video object segmentation (VOS) aims at pixel-level object tracking given only the annotations in the first frame. Due to the large visual variations of objects in video and the lack of training samples, it remains a difficult task despite the upsurging development of deep learning. Toward solving the VOS problem, we bring in several new insights by the proposed unified framework consisting of object proposal, tracking and segmentation components. The object proposal network transfers objectness information as generic knowledge into VOS; the tracking network identifies the target object from the proposals; and the segmentation network is performed based on the tracking results with a novel dynamic-reference based model adaptation scheme. Extensive experiments have been conducted on the DAVIS'17 dataset and the YouTube-VOS dataset, our method achieves the state-of-the-art performance on several video object segmentation benchmarks. We make the code publicly available at https://github.com/sydney0zq/PTSNet.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)77.7PTSNet
VideoDAVIS 2017 (val)J&F74.65PTSNet
VideoDAVIS 2017 (val)Jaccard (Mean)71.6PTSNet
Object TrackingYouTube-VOS 2018Jaccard (Seen)73.5PTSNet
Object TrackingYouTube-VOS 2018Jaccard (Unseen)64.3PTSNet
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)77.7PTSNet
Video Object SegmentationDAVIS 2017 (val)J&F74.65PTSNet
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)71.6PTSNet
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)77.7PTSNet
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F74.65PTSNet
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)71.6PTSNet
Visual Object TrackingYouTube-VOS 2018Jaccard (Seen)73.5PTSNet
Visual Object TrackingYouTube-VOS 2018Jaccard (Unseen)64.3PTSNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17