TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LSMVOS: Long-Short-Term Similarity Matching for Video Object

LSMVOS: Long-Short-Term Similarity Matching for Video Object

Zhang Xuerui, Yuan Xia

2020-09-02Semi-Supervised Video Object SegmentationOptical Flow EstimationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode

Abstract

Objective Semi-supervised video object segmentation refers to segmenting the object in subsequent frames given the object label in the first frame. Existing algorithms are mostly based on the objectives of matching and propagation strategies, which often make use of the previous frame with masking or optical flow. This paper explores a new propagation method, uses short-term matching modules to extract the information of the previous frame and apply it in propagation, and proposes the network of Long-Short-Term similarity matching for video object segmentation (LSMOVS) Method: By conducting pixel-level matching and correlation between long-term matching module and short-term matching module with the first frame and previous frame, global similarity map and local similarity map are obtained, as well as feature pattern of current frame and masking of previous frame. After two refine networks, final results are obtained through segmentation network. Results: According to the experiments on the two data sets DAVIS 2016 and 2017, the method of this paper achieves favorable average of region similarity and contour accuracy without online fine tuning, which achieves 86.5% and 77.4% in terms of single target and multiple targets. Besides, the count of segmented frames per second reached 21. Conclusion: The short-term matching module proposed in this paper is more conducive to extracting the information of the previous frame than only the mask. By combining the long-term matching module with the short-term matching module, the whole network can achieve efficient video object segmentation without online fine tuning

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Decay)15.7LSMVOS
VideoDAVIS 2017 (val)F-measure (Mean)80.8LSMVOS
VideoDAVIS 2017 (val)F-measure (Recall)91.3LSMVOS
VideoDAVIS 2017 (val)J&F77.4LSMVOS
VideoDAVIS 2017 (val)Jaccard (Decay)12.9LSMVOS
VideoDAVIS 2017 (val)Jaccard (Mean)73.9LSMVOS
VideoDAVIS 2017 (val)Jaccard (Recall)83.6LSMVOS
VideoDAVIS 2016F-measure (Decay)4.9LSMVOS
VideoDAVIS 2016F-measure (Mean)87.3LSMVOS
VideoDAVIS 2016F-measure (Recall)96.1LSMVOS
VideoDAVIS 2016J&F86.5LSMVOS
VideoDAVIS 2016Jaccard (Decay)5.1LSMVOS
VideoDAVIS 2016Jaccard (Mean)85.7LSMVOS
VideoDAVIS 2016Jaccard (Recall)97.1LSMVOS
VideoDAVIS 2017 (test-dev)F-measure (Decay)16.5LSMVOS
VideoDAVIS 2017 (test-dev)F-measure (Mean)71.2LSMVOS
VideoDAVIS 2017 (test-dev)F-measure (Recall)81.4LSMVOS
VideoDAVIS 2017 (test-dev)J&F67.4LSMVOS
VideoDAVIS 2017 (test-dev)Jaccard (Decay)16.9LSMVOS
VideoDAVIS 2017 (test-dev)Jaccard (Mean)63.7LSMVOS
VideoDAVIS 2017 (test-dev)Jaccard (Recall)72.7LSMVOS
Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)15.7LSMVOS
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)80.8LSMVOS
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)91.3LSMVOS
Video Object SegmentationDAVIS 2017 (val)J&F77.4LSMVOS
Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)12.9LSMVOS
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)73.9LSMVOS
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)83.6LSMVOS
Video Object SegmentationDAVIS 2016F-measure (Decay)4.9LSMVOS
Video Object SegmentationDAVIS 2016F-measure (Mean)87.3LSMVOS
Video Object SegmentationDAVIS 2016F-measure (Recall)96.1LSMVOS
Video Object SegmentationDAVIS 2016J&F86.5LSMVOS
Video Object SegmentationDAVIS 2016Jaccard (Decay)5.1LSMVOS
Video Object SegmentationDAVIS 2016Jaccard (Mean)85.7LSMVOS
Video Object SegmentationDAVIS 2016Jaccard (Recall)97.1LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)16.5LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)71.2LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)81.4LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)J&F67.4LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)16.9LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)63.7LSMVOS
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)72.7LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)15.7LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)80.8LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)91.3LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F77.4LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)12.9LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)73.9LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)83.6LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Decay)4.9LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)87.3LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Recall)96.1LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016J&F86.5LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Decay)5.1LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)85.7LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Recall)97.1LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)16.5LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)71.2LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)81.4LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F67.4LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)16.9LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)63.7LSMVOS
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)72.7LSMVOS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17