TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Collaborative Video Object Segmentation by Multi-Scale For...

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

Zongxin Yang, Yunchao Wei, Yi Yang

2020-10-13Semi-Supervised Video Object SegmentationOne-shot visual object segmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

This paper investigates the principles of embedding learning to tackle the challenging semi-supervised video object segmentation. Unlike previous practices that focus on exploring the embedding learning of foreground object (s), we consider background should be equally treated. Thus, we propose a Collaborative video object segmentation by Foreground-Background Integration (CFBI) approach. CFBI separates the feature embedding into the foreground object region and its corresponding background region, implicitly promoting them to be more contrastive and improving the segmentation results accordingly. Moreover, CFBI performs both pixel-level matching processes and instance-level attention mechanisms between the reference and the predicted sequence, making CFBI robust to various object scales. Based on CFBI, we introduce a multi-scale matching structure and propose an Atrous Matching strategy, resulting in a more robust and efficient framework, CFBI+. We conduct extensive experiments on two popular benchmarks, i.e., DAVIS and YouTube-VOS. Without applying any simulated data for pre-training, our CFBI+ achieves the performance (J&F) of 82.9% and 82.8%, outperforming all the other state-of-the-art methods. Code: https://github.com/z-x-yang/CFBI.

Results

TaskDatasetMetricValueModel
VideoYouTube-VOS 2019Jaccard (Unseen)77.1CFBI+
VideoYouTube-VOS 2018F-Measure (Seen)86.6CFBI+
VideoYouTube-VOS 2018F-Measure (Unseen)85.6CFBI+
VideoYouTube-VOS 2018Jaccard (Seen)81.8CFBI+
VideoYouTube-VOS 2018Jaccard (Unseen)77.1CFBI+
VideoYouTube-VOS 2018Mean Jaccard & F-Measure82.8CFBI+
VideoDAVIS 2017 (val)F-measure85.7CFBI+
VideoDAVIS 2017 (val)Jaccard80.1CFBI+
VideoDAVIS 2017 (val)Mean Jaccard & F-Measure82.9CFBI+
VideoDAVIS 2017 (val)F-measure (Mean)85.7CFBI+
VideoDAVIS 2017 (val)J&F82.9CFBI+
VideoDAVIS 2017 (val)Jaccard (Mean)80.1CFBI+
VideoDAVIS 2016F-measure (Mean)91.1CFBI+
VideoDAVIS 2016J&F89.9CFBI+
VideoDAVIS 2016Jaccard (Mean)88.7CFBI+
VideoDAVIS 2017 (test-dev)F-measure (Mean)81.6CFBI+
VideoDAVIS 2017 (test-dev)J&F78CFBI+
VideoDAVIS 2017 (test-dev)Jaccard (Mean)74.4CFBI+
VideoYouTube-VOS 2018F-Measure (Seen)86.6CFBI+
VideoYouTube-VOS 2018F-Measure (Unseen)85.6CFBI+
VideoYouTube-VOS 2018Jaccard (Seen)81.8CFBI+
VideoYouTube-VOS 2018Jaccard (Unseen)77.1CFBI+
VideoYouTube-VOS 2018Overall82.8CFBI+
VideoYouTube-VOS 2018Speed (FPS)4CFBI+
Video Object SegmentationYouTube-VOS 2019Jaccard (Unseen)77.1CFBI+
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)86.6CFBI+
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)85.6CFBI+
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)81.8CFBI+
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)77.1CFBI+
Video Object SegmentationYouTube-VOS 2018Mean Jaccard & F-Measure82.8CFBI+
Video Object SegmentationDAVIS 2017 (val)F-measure85.7CFBI+
Video Object SegmentationDAVIS 2017 (val)Jaccard80.1CFBI+
Video Object SegmentationDAVIS 2017 (val)Mean Jaccard & F-Measure82.9CFBI+
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)85.7CFBI+
Video Object SegmentationDAVIS 2017 (val)J&F82.9CFBI+
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)80.1CFBI+
Video Object SegmentationDAVIS 2016F-measure (Mean)91.1CFBI+
Video Object SegmentationDAVIS 2016J&F89.9CFBI+
Video Object SegmentationDAVIS 2016Jaccard (Mean)88.7CFBI+
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)81.6CFBI+
Video Object SegmentationDAVIS 2017 (test-dev)J&F78CFBI+
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)74.4CFBI+
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)86.6CFBI+
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)85.6CFBI+
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)81.8CFBI+
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)77.1CFBI+
Video Object SegmentationYouTube-VOS 2018Overall82.8CFBI+
Video Object SegmentationYouTube-VOS 2018Speed (FPS)4CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)85.7CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F82.9CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)80.1CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)91.1CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2016J&F89.9CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)88.7CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)81.6CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F78CFBI+
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)74.4CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)86.6CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)85.6CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)81.8CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)77.1CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall82.8CFBI+
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Speed (FPS)4CFBI+

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17