TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Efficient Video Object Segmentation via Network Modulation

Efficient Video Object Segmentation via Network Modulation

Linjie Yang, Yanran Wang, Xuehan Xiong, Jianchao Yang, Aggelos K. Katsaggelos

2018-02-04CVPR 2018 6Visual Object TrackingSemi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic SegmentationVideo Instance Segmentation
PaperPDFCode(official)

Abstract

Video object segmentation targets at segmenting a specific object throughout a video sequence, given only an annotated first frame. Recent deep learning based approaches find it effective by fine-tuning a general-purpose segmentation model on the annotated frame using hundreds of iterations of gradient descent. Despite the high accuracy these methods achieve, the fine-tuning process is inefficient and fail to meet the requirements of real world applications. We propose a novel approach that uses a single forward pass to adapt the segmentation model to the appearance of a specific object. Specifically, a second meta neural network named modulator is learned to manipulate the intermediate layers of the segmentation network given limited visual and spatial information of the target object. The experiments show that our approach is 70times faster than fine-tuning approaches while achieving similar accuracy.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Decay)24.3OSMN
VideoDAVIS 2017 (val)F-measure (Mean)57.1OSMN
VideoDAVIS 2017 (val)F-measure (Recall)66.1OSMN
VideoDAVIS 2017 (val)J&F54.8OSMN
VideoDAVIS 2017 (val)Jaccard (Decay)21.5OSMN
VideoDAVIS 2017 (val)Jaccard (Mean)52.5OSMN
VideoDAVIS 2017 (val)Jaccard (Recall)60.9OSMN
VideoDAVIS 2016F-measure (Decay)10.6OSMN
VideoDAVIS 2016F-measure (Mean)72.9OSMN
VideoDAVIS 2016F-measure (Recall)84OSMN
VideoDAVIS 2016J&F73.45OSMN
VideoDAVIS 2016Jaccard (Decay)9OSMN
VideoDAVIS 2016Jaccard (Mean)74OSMN
VideoDAVIS 2016Jaccard (Recall)87.6OSMN
VideoDAVIS 2017 (test-dev)F-measure (Decay)17.4OSMN
VideoDAVIS 2017 (test-dev)F-measure (Recall)47.4OSMN
VideoDAVIS 2017 (test-dev)J&F41.3OSMN
VideoDAVIS 2017 (test-dev)Jaccard (Decay)19OSMN
VideoDAVIS 2017 (test-dev)Jaccard (Mean)37.7OSMN
VideoDAVIS 2017 (test-dev)Jaccard (Recall)38.9OSMN
VideoYouTube-VOS 2018F-Measure (Seen)60.1OSMN
VideoYouTube-VOS 2018F-Measure (Unseen)44OSMN
VideoYouTube-VOS 2018Jaccard (Seen)60OSMN
VideoYouTube-VOS 2018Jaccard (Unseen)40.6OSMN
VideoYouTube-VOS 2018Overall51.2OSMN
VideoYouTube-VOS 2018Speed (FPS)7.14OSMN
VideoYouTube-VOS 2018Jaccard (Seen)60OSMN
Object TrackingYouTube-VOS 2018F-Measure (Seen)60.1OSMN
Object TrackingYouTube-VOS 2018F-Measure (Unseen)44OSMN
Object TrackingYouTube-VOS 2018Jaccard (Seen)60OSMN
Object TrackingYouTube-VOS 2018O (Average of Measures)51.2OSMN
Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)24.3OSMN
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)57.1OSMN
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)66.1OSMN
Video Object SegmentationDAVIS 2017 (val)J&F54.8OSMN
Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)21.5OSMN
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)52.5OSMN
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)60.9OSMN
Video Object SegmentationDAVIS 2016F-measure (Decay)10.6OSMN
Video Object SegmentationDAVIS 2016F-measure (Mean)72.9OSMN
Video Object SegmentationDAVIS 2016F-measure (Recall)84OSMN
Video Object SegmentationDAVIS 2016J&F73.45OSMN
Video Object SegmentationDAVIS 2016Jaccard (Decay)9OSMN
Video Object SegmentationDAVIS 2016Jaccard (Mean)74OSMN
Video Object SegmentationDAVIS 2016Jaccard (Recall)87.6OSMN
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)17.4OSMN
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)47.4OSMN
Video Object SegmentationDAVIS 2017 (test-dev)J&F41.3OSMN
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)19OSMN
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)37.7OSMN
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)38.9OSMN
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)60.1OSMN
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)44OSMN
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)60OSMN
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)40.6OSMN
Video Object SegmentationYouTube-VOS 2018Overall51.2OSMN
Video Object SegmentationYouTube-VOS 2018Speed (FPS)7.14OSMN
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)60OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)24.3OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)57.1OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)66.1OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F54.8OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)21.5OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)52.5OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)60.9OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Decay)10.6OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)72.9OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Recall)84OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016J&F73.45OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Decay)9OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)74OSMN
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Recall)87.6OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)17.4OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)47.4OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F41.3OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)19OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)37.7OSMN
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)38.9OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)60.1OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)44OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)60OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)40.6OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall51.2OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Speed (FPS)7.14OSMN
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)60OSMN
Video Instance SegmentationYouTube-VIS validationAP5028.6OSMN
Video Instance SegmentationYouTube-VIS validationAP7533.1OSMN
Video Instance SegmentationYouTube-VIS validationmask AP29.1OSMN
Visual Object TrackingYouTube-VOS 2018F-Measure (Seen)60.1OSMN
Visual Object TrackingYouTube-VOS 2018F-Measure (Unseen)44OSMN
Visual Object TrackingYouTube-VOS 2018Jaccard (Seen)60OSMN
Visual Object TrackingYouTube-VOS 2018O (Average of Measures)51.2OSMN

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17