TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Discriminative Feature with CRF for Unsupervised ...

Learning Discriminative Feature with CRF for Unsupervised Video Object Segmentation

Mingmin Zhen, Shiwei Li, Lei Zhou, Jiaxiang Shang, Haoan Feng, Tian Fang, Long Quan

2020-08-04ECCV 2020 8Unsupervised Video Object SegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic SegmentationRGB Salient Object Detection
PaperPDF

Abstract

In this paper, we introduce a novel network, called discriminative feature network (DFNet), to address the unsupervised video object segmentation task. To capture the inherent correlation among video frames, we learn discriminative features (D-features) from the input images that reveal feature distribution from a global perspective. The D-features are then used to establish correspondence with all features of test image under conditional random field (CRF) formulation, which is leveraged to enforce consistency between pixels. The experiments verify that DFNet outperforms state-of-the-art methods by a large margin with a mean IoU score of 83.4% and ranks first on the DAVIS-2016 leaderboard while using much fewer parameters and achieving much more efficient performance in the inference phase. We further evaluate DFNet on the FBMS dataset and the video saliency dataset ViSal, reaching a new state-of-the-art. To further demonstrate the generalizability of our framework, DFNet is also applied to the image object co-segmentation task. We perform experiments on a challenging dataset PASCAL-VOC and observe the superiority of DFNet. The thorough experiments verify that DFNet is able to capture and mine the underlying relations of images and discover the common foreground objects.

Results

TaskDatasetMetricValueModel
VideoFBMSF-Score82.3DFNet
VideoDAVIS 2016F-Score81.8DFNet
VideoDAVIS 2016Jaccard (Mean)83.4Ours
VideoDAVIS 2016 valF81.8DFNet
VideoDAVIS 2016 valG82.6DFNet
VideoDAVIS 2016 valJ83.4DFNet
Video Object SegmentationFBMSF-Score82.3DFNet
Video Object SegmentationDAVIS 2016F-Score81.8DFNet
Video Object SegmentationDAVIS 2016Jaccard (Mean)83.4Ours
Video Object SegmentationDAVIS 2016 valF81.8DFNet
Video Object SegmentationDAVIS 2016 valG82.6DFNet
Video Object SegmentationDAVIS 2016 valJ83.4DFNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15