TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Transductive Approach for Video Object Segmentation

A Transductive Approach for Video Object Segmentation

Yizhuo Zhang, Zhirong Wu, Houwen Peng, Stephen Lin

2020-04-15CVPR 2020 6Semi-Supervised Video Object SegmentationOptical Flow EstimationSemantic SegmentationVideo Object SegmentationInstance SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

Semi-supervised video object segmentation aims to separate a target object from a video sequence, given the mask in the first frame. Most of current prevailing methods utilize information from additional modules trained in other domains like optical flow and instance segmentation, and as a result they do not compete with other methods on common ground. To address this issue, we propose a simple yet strong transductive method, in which additional modules, datasets, and dedicated architectural designs are not needed. Our method takes a label propagation approach where pixel labels are passed forward based on feature similarity in an embedding space. Different from other propagation methods, ours diffuses temporal information in a holistic manner which take accounts of long-term object appearance. In addition, our method requires few additional computational overhead, and runs at a fast $\sim$37 fps speed. Our single model with a vanilla ResNet50 backbone achieves an overall score of 72.3 on the DAVIS 2017 validation set and 63.1 on the test set. This simple yet high performing and efficient method can serve as a solid baseline that facilitates future research. Code and models are available at \url{https://github.com/microsoft/transductive-vos.pytorch}.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)74.7TVOS
VideoDAVIS 2017 (val)J&F72.3TVOS
VideoDAVIS 2017 (val)Jaccard (Mean)69.9TVOS
VideoDAVIS (no YouTube-VOS training)D17 test (F)67.4TVOS
VideoDAVIS (no YouTube-VOS training)D17 test (G)63.1TVOS
VideoDAVIS (no YouTube-VOS training)D17 test (J)58.8TVOS
VideoDAVIS (no YouTube-VOS training)D17 val (F)74.7TVOS
VideoDAVIS (no YouTube-VOS training)D17 val (G)72.3TVOS
VideoDAVIS (no YouTube-VOS training)D17 val (J)69.9TVOS
VideoDAVIS (no YouTube-VOS training)FPS37TVOS
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)74.7TVOS
Video Object SegmentationDAVIS 2017 (val)J&F72.3TVOS
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)69.9TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (F)67.4TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (G)63.1TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (J)58.8TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)74.7TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)72.3TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)69.9TVOS
Video Object SegmentationDAVIS (no YouTube-VOS training)FPS37TVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)74.7TVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F72.3TVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)69.9TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (F)67.4TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (G)63.1TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (J)58.8TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)74.7TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)72.3TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)69.9TVOS
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)FPS37TVOS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15