TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Memory Matching is not Enough: Jointly Improving Memory Ma...

Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation

Jintu Zheng, Yun Liang, Yuqing Zhang, Wanchao Su

2024-09-22Semi-Supervised Video Object SegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDF

Abstract

Memory-based video object segmentation methods model multiple objects over long temporal-spatial spans by establishing memory bank, which achieve the remarkable performance. However, they struggle to overcome the false matching and are prone to lose critical information, resulting in confusion among different objects. In this paper, we propose an effective approach which jointly improving the matching and decoding stages to alleviate the false matching issue.For the memory matching stage, we present a cost aware mechanism that suppresses the slight errors for short-term memory and a shunted cross-scale matching for long-term memory which establish a wide filed matching spaces for various object scales. For the readout decoding stage, we implement a compensatory mechanism aims at recovering the essential information where missing at the matching stage. Our approach achieves the outstanding performance in several popular benchmarks (i.e., DAVIS 2016&2017 Val (92.4%&88.1%), and DAVIS 2017 Test (83.9%)), and achieves 84.8%&84.6% on YouTubeVOS 2018&2019 Val.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Mean)91JIMD
VideoDAVIS 2017 (val)J&F88.1JIMD
VideoDAVIS 2017 (val)Jaccard (Mean)85.2JIMD
VideoDAVIS 2017 (test-dev)F-measure (Mean)87.4JIMD-R50
VideoDAVIS 2017 (test-dev)J&F83.9JIMD-R50
VideoDAVIS 2017 (test-dev)Jaccard (Mean)80.3JIMD-R50
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)91JIMD
Video Object SegmentationDAVIS 2017 (val)J&F88.1JIMD
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)85.2JIMD
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)87.4JIMD-R50
Video Object SegmentationDAVIS 2017 (test-dev)J&F83.9JIMD-R50
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)80.3JIMD-R50
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)91JIMD
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F88.1JIMD
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)85.2JIMD
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)87.4JIMD-R50
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F83.9JIMD-R50
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)80.3JIMD-R50

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15