TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Object Segmentation with Adaptive Feature Bank and U...

Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement

Yongqing Liang, Xin Li, Navid Jafari, Qin Chen

2020-10-15NeurIPS 2020 12Semi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

We propose a new matching-based framework for semi-supervised video object segmentation (VOS). Recently, state-of-the-art VOS performance has been achieved by matching-based algorithms, in which feature banks are created to store features for region matching and classification. However, how to effectively organize information in the continuously growing feature bank remains under-explored, and this leads to inefficient design of the bank. We introduce an adaptive feature bank update scheme to dynamically absorb new features and discard obsolete features. We also design a new confidence loss and a fine-grained segmentation module to enhance the segmentation accuracy in uncertain regions. On public benchmarks, our algorithm outperforms existing state-of-the-arts.

Results

TaskDatasetMetricValueModel
VideoYouTube-VOS 2018F-Measure (Seen)83.1AFB-URR
VideoYouTube-VOS 2018F-Measure (Unseen)82.6AFB-URR
VideoYouTube-VOS 2018Jaccard (Seen)78.8AFB-URR
VideoYouTube-VOS 2018Jaccard (Unseen)74.1AFB-URR
VideoYouTube-VOS 2018Mean Jaccard & F-Measure79.6AFB-URR
VideoDAVIS 2017 (val)F-measure76.1AFB-URR
VideoDAVIS 2017 (val)Jaccard73AFB-URR
VideoDAVIS 2017 (val)Mean Jaccard & F-Measure74.6AFB-URR
VideoDAVIS 2017 (val)F-measure (Decay)15.5AFB-URR
VideoDAVIS 2017 (val)F-measure (Mean)76.1AFB-URR
VideoDAVIS 2017 (val)F-measure (Recall)87AFB-URR
VideoDAVIS 2017 (val)J&F74.6AFB-URR
VideoDAVIS 2017 (val)Jaccard (Decay)13.8AFB-URR
VideoDAVIS 2017 (val)Jaccard (Mean)73AFB-URR
VideoDAVIS 2017 (val)Jaccard (Recall)85.3AFB-URR
VideoLong Video Dataset (3X)F84.6AFB-URR
VideoLong Video Dataset (3X)J82.9AFB-URR
VideoLong Video Dataset (3X)J&F83.8AFB-URR
VideoLong Video DatasetF84.5AFB-URR
VideoLong Video DatasetJ82.9AFB-URR
VideoLong Video DatasetJ&F83.7AFB-URR
VideoDAVIS (no YouTube-VOS training)D17 val (F)76.1AFB-URR
VideoDAVIS (no YouTube-VOS training)D17 val (G)74.6AFB-URR
VideoDAVIS (no YouTube-VOS training)D17 val (J)73AFB-URR
VideoDAVIS (no YouTube-VOS training)FPS4AFB-URR
VideoYouTube-VOS 2018F-Measure (Seen)83.1AFB-URR
VideoYouTube-VOS 2018F-Measure (Unseen)82.6AFB-URR
VideoYouTube-VOS 2018Jaccard (Seen)78.8AFB-URR
VideoYouTube-VOS 2018Jaccard (Unseen)74.1AFB-URR
VideoYouTube-VOS 2018Overall79.6AFB-URR
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)83.1AFB-URR
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)82.6AFB-URR
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)78.8AFB-URR
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)74.1AFB-URR
Video Object SegmentationYouTube-VOS 2018Mean Jaccard & F-Measure79.6AFB-URR
Video Object SegmentationDAVIS 2017 (val)F-measure76.1AFB-URR
Video Object SegmentationDAVIS 2017 (val)Jaccard73AFB-URR
Video Object SegmentationDAVIS 2017 (val)Mean Jaccard & F-Measure74.6AFB-URR
Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)15.5AFB-URR
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)76.1AFB-URR
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)87AFB-URR
Video Object SegmentationDAVIS 2017 (val)J&F74.6AFB-URR
Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)13.8AFB-URR
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)73AFB-URR
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)85.3AFB-URR
Video Object SegmentationLong Video Dataset (3X)F84.6AFB-URR
Video Object SegmentationLong Video Dataset (3X)J82.9AFB-URR
Video Object SegmentationLong Video Dataset (3X)J&F83.8AFB-URR
Video Object SegmentationLong Video DatasetF84.5AFB-URR
Video Object SegmentationLong Video DatasetJ82.9AFB-URR
Video Object SegmentationLong Video DatasetJ&F83.7AFB-URR
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)76.1AFB-URR
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)74.6AFB-URR
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)73AFB-URR
Video Object SegmentationDAVIS (no YouTube-VOS training)FPS4AFB-URR
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)83.1AFB-URR
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)82.6AFB-URR
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)78.8AFB-URR
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)74.1AFB-URR
Video Object SegmentationYouTube-VOS 2018Overall79.6AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)15.5AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)76.1AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)87AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F74.6AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)13.8AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)73AFB-URR
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)85.3AFB-URR
Semi-Supervised Video Object SegmentationLong Video Dataset (3X)F84.6AFB-URR
Semi-Supervised Video Object SegmentationLong Video Dataset (3X)J82.9AFB-URR
Semi-Supervised Video Object SegmentationLong Video Dataset (3X)J&F83.8AFB-URR
Semi-Supervised Video Object SegmentationLong Video DatasetF84.5AFB-URR
Semi-Supervised Video Object SegmentationLong Video DatasetJ82.9AFB-URR
Semi-Supervised Video Object SegmentationLong Video DatasetJ&F83.7AFB-URR
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)76.1AFB-URR
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)74.6AFB-URR
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)73AFB-URR
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)FPS4AFB-URR
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)83.1AFB-URR
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)82.6AFB-URR
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)78.8AFB-URR
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)74.1AFB-URR
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall79.6AFB-URR

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17