TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bootstrapping Objectness from Videos by Relaxed Common Fat...

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

Long Lian, Zhirong Wu, Stella X. Yu

2023-04-17CVPR 2023 1Unsupervised Video Object SegmentationOptical Flow EstimationMotion SegmentationSegmentationSemantic SegmentationObject DiscoveryVideo Object SegmentationVideo Semantic SegmentationUnsupervised Object Segmentation
PaperPDFCode(official)

Abstract

We study learning object segmentation from unlabeled videos. Humans can easily segment moving objects without knowing what they are. The Gestalt law of common fate, i.e., what move at the same speed belong together, has inspired unsupervised object discovery based on motion segmentation. However, common fate is not a reliable indicator of objectness: Parts of an articulated / deformable object may not move at the same speed, whereas shadows / reflections of an object always move with it but are not part of it. Our insight is to bootstrap objectness by first learning image features from relaxed common fate and then refining them based on visual appearance grouping within the image itself and across images statistically. Specifically, we learn an image segmenter first in the loop of approximating optical flow with constant segment flow plus small within-segment residual flow, and then by refining it for more coherent appearance and statistical figure-ground relevance. On unsupervised video object segmentation, using only ResNet and convolutional heads, our model surpasses the state-of-the-art by absolute gains of 7/9/5% on DAVIS16 / STv2 / FBMS59 respectively, demonstrating the effectiveness of our ideas. Our code is publicly available.

Results

TaskDatasetMetricValueModel
Instance SegmentationSegTrack-v2mIoU79.6RCF (with post-processing)
Instance SegmentationSegTrack-v2mIoU76.7RCF (without post-processing)
Instance SegmentationFBMS-59mIoU72.4RCF (with post-processing)
Instance SegmentationFBMS-59mIoU69.9RCF (without post-processing)
Instance SegmentationDAVIS 2016J score83RCF (with Post-Processing)
Instance SegmentationDAVIS 2016J score80.9RCF (without Post-Processing)
Unsupervised Object SegmentationSegTrack-v2mIoU79.6RCF (with post-processing)
Unsupervised Object SegmentationSegTrack-v2mIoU76.7RCF (without post-processing)
Unsupervised Object SegmentationFBMS-59mIoU72.4RCF (with post-processing)
Unsupervised Object SegmentationFBMS-59mIoU69.9RCF (without post-processing)
Unsupervised Object SegmentationDAVIS 2016J score83RCF (with Post-Processing)
Unsupervised Object SegmentationDAVIS 2016J score80.9RCF (without Post-Processing)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17