TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/The Emergence of Objectness: Learning Zero-Shot Segmentati...

The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos

Runtao Liu, Zhirong Wu, Stella X. Yu, Stephen Lin

2021-11-11NeurIPS 2021 12Video Polyp SegmentationZero Shot SegmentationSegmentationSemantic SegmentationContrastive LearningTest-time AdaptationUnsupervised Object SegmentationImage Segmentation
PaperPDFCode(official)

Abstract

Humans can easily segment moving objects without knowing what they are. That objectness could emerge from continuous visual observations motivates us to model grouping and movement concurrently from unlabeled videos. Our premise is that a video has different views of the same scene related by moving components, and the right region segmentation and region flow would allow mutual view synthesis which can be checked from the data itself without any external supervision. Our model starts with two separate pathways: an appearance pathway that outputs feature-based region segmentation for a single image, and a motion pathway that outputs motion features for a pair of images. It then binds them in a conjoint representation called segment flow that pools flow offsets over each region and provides a gross characterization of moving regions for the entire scene. By training the model to minimize view synthesis errors based on segment flow, our appearance and motion pathways learn region segmentation and flow estimation automatically without building them up from low-level edges or optical flows respectively. Our model demonstrates the surprising emergence of objectness in the appearance pathway, surpassing prior works on zero-shot object segmentation from an image, moving object segmentation from a video with unsupervised test-time adaptation, and semantic image segmentation by supervised fine-tuning. Our work is the first truly end-to-end zero-shot object segmentation from videos. It not only develops generic objectness for segmentation and tracking, but also outperforms prevalent image-based contrastive learning methods without augmentation engineering.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationSUN-SEG-Easy (Unseen)Dice0.266AMD
Medical Image SegmentationSUN-SEG-Easy (Unseen)S measure0.474AMD
Medical Image SegmentationSUN-SEG-Easy (Unseen)Sensitivity0.222AMD
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean E-measure0.533AMD
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean F-measure0.146AMD
Medical Image SegmentationSUN-SEG-Easy (Unseen)weighted F-measure0.133AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)Dice0.252AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)S-Measure0.472AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)Sensitivity0.213AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean E-measure0.527AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean F-measure0.141AMD
Medical Image SegmentationSUN-SEG-Hard (Unseen)weighted F-measure0.128AMD
Instance SegmentationSegTrack-v2mIoU57AMD
Instance SegmentationFBMS-59mIoU47.5AMD
Instance SegmentationDAVIS 2016J score57.8AMD
Unsupervised Object SegmentationSegTrack-v2mIoU57AMD
Unsupervised Object SegmentationFBMS-59mIoU47.5AMD
Unsupervised Object SegmentationDAVIS 2016J score57.8AMD

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17