TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Object Segmentation Without Temporal Information

Video Object Segmentation Without Temporal Information

Kevis-Kokitsi Maninis, Sergi Caelles, Yu-Hua Chen, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc van Gool

2017-09-18Foreground SegmentationSemi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDF

Abstract

Video Object Segmentation, and video processing in general, has been historically dominated by methods that rely on the temporal consistency and redundancy in consecutive video frames. When the temporal smoothness is suddenly broken, such as when an object is occluded, or some frames are missing in a sequence, the result of these methods can deteriorate significantly or they may not even produce any result at all. This paper explores the orthogonal approach of processing each frame independently, i.e disregarding the temporal information. In particular, it tackles the task of semi-supervised video object segmentation: the separation of an object from the background in a video, given its mask in the first frame. We present Semantic One-Shot Video Object Segmentation (OSVOS-S), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one shot). We show that instance level semantic information, when combined effectively, can dramatically improve the results of our previous method, OSVOS. We perform experiments on two recent video segmentation databases, which show that OSVOS-S is both the fastest and most accurate method in the state of the art.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Decay)18.5OSVOS-S
VideoDAVIS 2017 (val)F-measure (Mean)71.3OSVOS-S
VideoDAVIS 2017 (val)F-measure (Recall)80.7OSVOS-S
VideoDAVIS 2017 (val)J&F68OSVOS-S
VideoDAVIS 2017 (val)Jaccard (Decay)15.1OSVOS-S
VideoDAVIS 2017 (val)Jaccard (Mean)64.7OSVOS-S
VideoDAVIS 2017 (val)Jaccard (Recall)74.2OSVOS-S
VideoDAVIS 2016F-measure (Decay)8.2OSVOS-S
VideoDAVIS 2016F-measure (Mean)87.5OSVOS-S
VideoDAVIS 2016F-measure (Recall)95.9OSVOS-S
VideoDAVIS 2016J&F86.55OSVOS-S
VideoDAVIS 2016Jaccard (Decay)5.5OSVOS-S
VideoDAVIS 2016Jaccard (Mean)85.6OSVOS-S
VideoDAVIS 2016Jaccard (Recall)96.8OSVOS-S
VideoDAVIS 2017 (test-dev)F-measure (Decay)21.9OSVOS-S
VideoDAVIS 2017 (test-dev)F-measure (Mean)62.1OSVOS-S
VideoDAVIS 2017 (test-dev)F-measure (Recall)70.5OSVOS-S
VideoDAVIS 2017 (test-dev)J&F57.5OSVOS-S
VideoDAVIS 2017 (test-dev)Jaccard (Decay)24.1OSVOS-S
VideoDAVIS 2017 (test-dev)Jaccard (Mean)52.9OSVOS-S
VideoDAVIS 2017 (test-dev)Jaccard (Recall)60.2OSVOS-S
Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)18.5OSVOS-S
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.3OSVOS-S
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)80.7OSVOS-S
Video Object SegmentationDAVIS 2017 (val)J&F68OSVOS-S
Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)15.1OSVOS-S
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)64.7OSVOS-S
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)74.2OSVOS-S
Video Object SegmentationDAVIS 2016F-measure (Decay)8.2OSVOS-S
Video Object SegmentationDAVIS 2016F-measure (Mean)87.5OSVOS-S
Video Object SegmentationDAVIS 2016F-measure (Recall)95.9OSVOS-S
Video Object SegmentationDAVIS 2016J&F86.55OSVOS-S
Video Object SegmentationDAVIS 2016Jaccard (Decay)5.5OSVOS-S
Video Object SegmentationDAVIS 2016Jaccard (Mean)85.6OSVOS-S
Video Object SegmentationDAVIS 2016Jaccard (Recall)96.8OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)21.9OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)62.1OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)70.5OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)J&F57.5OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)24.1OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)52.9OSVOS-S
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)60.2OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)18.5OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.3OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)80.7OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F68OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)15.1OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)64.7OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)74.2OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Decay)8.2OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)87.5OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Recall)95.9OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016J&F86.55OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Decay)5.5OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)85.6OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Recall)96.8OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)21.9OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)62.1OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)70.5OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F57.5OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)24.1OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)52.9OSVOS-S
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)60.2OSVOS-S

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17