TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Tackling Background Distraction in Video Object Segmentation

Tackling Background Distraction in Video Object Segmentation

Suhwan Cho, Heansung Lee, Minhyeok Lee, Chaewon Park, Sungjun Jang, Minjung Kim, Sangyoun Lee

2022-07-14Semi-Supervised Video Object SegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

Semi-supervised video object segmentation (VOS) aims to densely track certain designated objects in videos. One of the main challenges in this task is the existence of background distractors that appear similar to the target objects. We propose three novel strategies to suppress such distractors: 1) a spatio-temporally diversified template construction scheme to obtain generalized properties of the target objects; 2) a learnable distance-scoring function to exclude spatially-distant distractors by exploiting the temporal consistency between two consecutive frames; 3) swap-and-attach augmentation to force each object to have unique features by providing training samples containing entangled objects. On all public benchmark datasets, our model achieves a comparable performance to contemporary state-of-the-art approaches, even with real-time performance. Qualitative results also demonstrate the superiority of our approach over existing methods. We believe our approach will be widely used for future VOS research.

Results

TaskDatasetMetricValueModel
VideoDAVIS (no YouTube-VOS training)D16 val (F)86.2TBD
VideoDAVIS (no YouTube-VOS training)D16 val (G)86.8TBD
VideoDAVIS (no YouTube-VOS training)D16 val (J)87.5TBD
VideoDAVIS (no YouTube-VOS training)D17 test (F)72.2TBD
VideoDAVIS (no YouTube-VOS training)D17 test (G)69.4TBD
VideoDAVIS (no YouTube-VOS training)D17 test (J)66.6TBD
VideoDAVIS (no YouTube-VOS training)D17 val (F)82.3TBD
VideoDAVIS (no YouTube-VOS training)D17 val (G)80TBD
VideoDAVIS (no YouTube-VOS training)D17 val (J)77.6TBD
VideoDAVIS (no YouTube-VOS training)FPS50.1TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (F)86.2TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (G)86.8TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (J)87.5TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (F)72.2TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (G)69.4TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (J)66.6TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)82.3TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)80TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)77.6TBD
Video Object SegmentationDAVIS (no YouTube-VOS training)FPS50.1TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (F)86.2TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (G)86.8TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D16 val (J)87.5TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (F)72.2TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (G)69.4TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 test (J)66.6TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (F)82.3TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (G)80TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)D17 val (J)77.6TBD
Semi-Supervised Video Object SegmentationDAVIS (no YouTube-VOS training)FPS50.1TBD

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15