TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Treating Motion as Option to Reduce Motion Dependency in U...

Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation

Suhwan Cho, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee

2022-09-04Unsupervised Video Object SegmentationOptical Flow EstimationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)Code

Abstract

Unsupervised video object segmentation (VOS) aims to detect the most salient object in a video sequence at the pixel level. In unsupervised VOS, most state-of-the-art methods leverage motion cues obtained from optical flow maps in addition to appearance cues to exploit the property that salient objects usually have distinctive movements compared to the background. However, as they are overly dependent on motion cues, which may be unreliable in some cases, they cannot achieve stable prediction. To reduce this motion dependency of existing two-stream VOS methods, we propose a novel motion-as-option network that optionally utilizes motion cues. Additionally, to fully exploit the property of the proposed network that motion is not always required, we introduce a collaborative network learning strategy. On all the public benchmark datasets, our proposed network affords state-of-the-art performance with real-time inference speed.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2016 valF87.8TMO (MiT-b1)
VideoDAVIS 2016 valG87.2TMO (MiT-b1)
VideoDAVIS 2016 valJ86.6TMO (MiT-b1)
VideoDAVIS 2016 valF86.6TMO (RN-101)
VideoDAVIS 2016 valG86.1TMO (RN-101)
VideoDAVIS 2016 valJ85.6TMO (RN-101)
VideoYouTube-ObjectsJ71.5TMO (RN-101)
VideoYouTube-ObjectsJ71.1TMO (MiT-b1)
VideoFBMS testJ80TMO (MiT-b1)
VideoFBMS testJ79.9TMO (RN-101)
Video Object SegmentationDAVIS 2016 valF87.8TMO (MiT-b1)
Video Object SegmentationDAVIS 2016 valG87.2TMO (MiT-b1)
Video Object SegmentationDAVIS 2016 valJ86.6TMO (MiT-b1)
Video Object SegmentationDAVIS 2016 valF86.6TMO (RN-101)
Video Object SegmentationDAVIS 2016 valG86.1TMO (RN-101)
Video Object SegmentationDAVIS 2016 valJ85.6TMO (RN-101)
Video Object SegmentationYouTube-ObjectsJ71.5TMO (RN-101)
Video Object SegmentationYouTube-ObjectsJ71.1TMO (MiT-b1)
Video Object SegmentationFBMS testJ80TMO (MiT-b1)
Video Object SegmentationFBMS testJ79.9TMO (RN-101)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15