TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dense Unsupervised Learning for Video Segmentation

Dense Unsupervised Learning for Video Segmentation

Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

2021-11-11NeurIPS 2021 12Unsupervised Video Object SegmentationSemi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train such a model results in a degenerate solution. We propose to prevent this with a simple regularisation scheme, accommodating the equivariance property of the segmentation task to similarity transformations. Our training objective admits efficient implementation and exhibits fast training convergence. On established VOS benchmarks, our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)71.7Araslanov et al.
VideoDAVIS 2017 (val)F-measure (Recall)84.8Araslanov et al.
VideoDAVIS 2017 (val)J&F69.4Araslanov et al.
VideoDAVIS 2017 (val)Jaccard (Mean)67.1Araslanov et al.
VideoDAVIS 2017 (val)Jaccard (Recall)80.9Araslanov et al.
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.7Araslanov et al.
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)84.8Araslanov et al.
Video Object SegmentationDAVIS 2017 (val)J&F69.4Araslanov et al.
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)67.1Araslanov et al.
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)80.9Araslanov et al.
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)71.7Araslanov et al.
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)84.8Araslanov et al.
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F69.4Araslanov et al.
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)67.1Araslanov et al.
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)80.9Araslanov et al.

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17