TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Video Object Segmentation from Static Images

Learning Video Object Segmentation from Static Images

Anna Khoreva, Federico Perazzi, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung

2016-12-08CVPR 2017 7Visual Object TrackingSemi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationObject TrackingInstance SegmentationVideo Semantic Segmentation
PaperPDFCodeCode

Abstract

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce video object segmentation problem as a concept of guided instance segmentation. Our model proceeds on a per-frame basis, guided by the output of the previous frame towards the object of interest in the next frame. We demonstrate that highly accurate object segmentation in videos can be enabled by using a convnet trained with static images only. The key ingredient of our approach is a combination of offline and online learning strategies, where the former serves to produce a refined mask from the previous frame estimate and the latter allows to capture the appearance of the specific object instance. Our method can handle different types of input annotations: bounding boxes and segments, as well as incorporate multiple annotated frames, making the system suitable for diverse applications. We obtain competitive results on three different datasets, independently from the type of input annotation.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2016F-measure (Decay)9MSK
VideoDAVIS 2016F-measure (Mean)75.4MSK
VideoDAVIS 2016F-measure (Recall)87.1MSK
VideoDAVIS 2016J&F77.55MSK
VideoDAVIS 2016Jaccard (Decay)8.9MSK
VideoDAVIS 2016Jaccard (Mean)79.7MSK
VideoDAVIS 2016Jaccard (Recall)93.1MSK
VideoYouTubemIoU0.726MaskTrack
Video Object SegmentationDAVIS 2016F-measure (Decay)9MSK
Video Object SegmentationDAVIS 2016F-measure (Mean)75.4MSK
Video Object SegmentationDAVIS 2016F-measure (Recall)87.1MSK
Video Object SegmentationDAVIS 2016J&F77.55MSK
Video Object SegmentationDAVIS 2016Jaccard (Decay)8.9MSK
Video Object SegmentationDAVIS 2016Jaccard (Mean)79.7MSK
Video Object SegmentationDAVIS 2016Jaccard (Recall)93.1MSK
Video Object SegmentationYouTubemIoU0.726MaskTrack
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Decay)9MSK
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)75.4MSK
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Recall)87.1MSK
Semi-Supervised Video Object SegmentationDAVIS 2016J&F77.55MSK
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Decay)8.9MSK
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)79.7MSK
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Recall)93.1MSK
Semi-Supervised Video Object SegmentationYouTubemIoU0.726MaskTrack

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17