TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/See More, Know More: Unsupervised Video Object Segmentatio...

See More, Know More: Unsupervised Video Object Segmentation with Co-Attention Siamese Networks

Xiankai Lu, Wenguan Wang, Chao Ma, Jianbing Shen, Ling Shao, Fatih Porikli

2020-01-19CVPR 2019 6Unsupervised Video Object SegmentationVideo Polyp SegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

We introduce a novel network, called CO-attention Siamese Network (COSNet), to address the unsupervised video object segmentation task from a holistic view. We emphasize the importance of inherent correlation among video frames and incorporate a global co-attention mechanism to improve further the state-of-the-art deep learning based solutions that primarily focus on learning discriminative foreground representations over appearance and motion in short-term temporal segments. The co-attention layers in our network provide efficient and competent stages for capturing global correlations and scene context by jointly computing and appending co-attention responses into a joint feature space. We train COSNet with pairs of video frames, which naturally augments training data and allows increased learning capacity. During the segmentation stage, the co-attention model encodes useful information by processing multiple reference frames together, which is leveraged to infer the frequently reappearing and salient foreground objects better. We propose a unified and end-to-end trainable framework where different co-attention variants can be derived for mining the rich context within videos. Our extensive experiments over three large benchmarks manifest that COSNet outperforms the current alternatives by a large margin.

Results

TaskDatasetMetricValueModel
Medical Image SegmentationSUN-SEG-Easy (Unseen)Dice0.596COSNet
Medical Image SegmentationSUN-SEG-Easy (Unseen)S measure0.654COSNet
Medical Image SegmentationSUN-SEG-Easy (Unseen)Sensitivity0.359COSNet
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean E-measure0.6COSNet
Medical Image SegmentationSUN-SEG-Easy (Unseen)mean F-measure0.496COSNet
Medical Image SegmentationSUN-SEG-Easy (Unseen)weighted F-measure0.431COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)Dice0.606COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)S-Measure0.67COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)Sensitivity0.38COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean E-measure0.627COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)mean F-measure0.506COSNet
Medical Image SegmentationSUN-SEG-Hard (Unseen)weighted F-measure0.443COSNet
VideoDAVIS 2016 valF79.4COSNet
VideoDAVIS 2016 valG80COSNet
VideoDAVIS 2016 valJ80.5COSNet
VideoYouTube-ObjectsJ70.5COSNet
VideoFBMS testJ75.6COSNet
Video Object SegmentationDAVIS 2016 valF79.4COSNet
Video Object SegmentationDAVIS 2016 valG80COSNet
Video Object SegmentationDAVIS 2016 valJ80.5COSNet
Video Object SegmentationYouTube-ObjectsJ70.5COSNet
Video Object SegmentationFBMS testJ75.6COSNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15