TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Putting the Object Back into Video Object Segmentation

Putting the Object Back into Video Object Segmentation

Ho Kei Cheng, Seoung Wug Oh, Brian Price, Joon-Young Lee, Alexander Schwing

2023-10-19CVPR 2024 1Visual Object TrackingSemi-Supervised Video Object SegmentationSegmentationSemantic SegmentationVideo Object SegmentationVideo Semantic Segmentation
PaperPDFCode(official)

Abstract

We present Cutie, a video object segmentation (VOS) network with object-level memory reading, which puts the object representation from memory back into the video object segmentation result. Recent works on VOS employ bottom-up pixel-level memory reading which struggles due to matching noise, especially in the presence of distractors, resulting in lower performance in more challenging data. In contrast, Cutie performs top-down object-level memory reading by adapting a small set of object queries. Via those, it interacts with the bottom-up pixel features iteratively with a query-based object transformer (qt, hence Cutie). The object queries act as a high-level summary of the target object, while high-resolution feature maps are retained for accurate segmentation. Together with foreground-background masked attention, Cutie cleanly separates the semantics of the foreground object from the background. On the challenging MOSE dataset, Cutie improves by 8.7 J&F over XMem with a similar running time and improves by 4.2 J&F over DeAOT while being three times faster. Code is available at: https://hkchengrex.github.io/Cutie

Results

TaskDatasetMetricValueModel
VideoMOSEJ&F68.3Cutie
VideoM$^3$-VOSAverage IOU74.6Cutie-base
VideoMOSEF75.8Cutie+ (base, MEGA)
VideoMOSEFPS17.9Cutie+ (base, MEGA)
VideoMOSEJ67.6Cutie+ (base, MEGA)
VideoMOSEJ&F71.7Cutie+ (base, MEGA)
VideoMOSEF74.5Cutie+ (small, MEGA)
VideoMOSEFPS20.6Cutie+ (small, MEGA)
VideoMOSEJ66Cutie+ (small, MEGA)
VideoMOSEJ&F70.3Cutie+ (small, MEGA)
VideoMOSEF74.1Cutie (base, MEGA)
VideoMOSEFPS36.4Cutie (base, MEGA)
VideoMOSEJ65.8Cutie (base, MEGA)
VideoMOSEJ&F69.9Cutie (base, MEGA)
VideoMOSEF72.9Cutie (small, MEGA)
VideoMOSEFPS45.5Cutie (small, MEGA)
VideoMOSEJ64.3Cutie (small, MEGA)
VideoMOSEJ&F68.6Cutie (small, MEGA)
VideoMOSEF72.3Cutie (base, with mose)
VideoMOSEFPS36.4Cutie (base, with mose)
VideoMOSEJ64.2Cutie (base, with mose)
VideoMOSEJ&F68.3Cutie (base, with mose)
VideoMOSEF71.7Cutie (small, with mose)
VideoMOSEFPS45.5Cutie (small, with mose)
VideoMOSEJ63.1Cutie (small, with mose)
VideoMOSEJ&F67.4Cutie (small, with mose)
VideoMOSEF67.9Cutie (base)
VideoMOSEFPS36.4Cutie (base)
VideoMOSEJ60Cutie (base)
VideoMOSEJ&F64Cutie (base)
VideoMOSEF66.2Cutie (small)
VideoMOSEFPS45.5Cutie (small)
VideoMOSEJ58.2Cutie (small)
VideoMOSEJ&F62.2Cutie (small)
VideoDAVIS 2017 (val)F-measure (Mean)93.4Cutie+ (base)
VideoDAVIS 2017 (val)J&F90.5Cutie+ (base)
VideoDAVIS 2017 (val)Jaccard (Mean)87.5Cutie+ (base)
VideoDAVIS 2017 (val)Params(M)17.9Cutie+ (base)
VideoDAVIS 2017 (val)F-measure (Mean)90.8Cutie+ (base, MEGA)
VideoDAVIS 2017 (val)J&F88.1Cutie+ (base, MEGA)
VideoDAVIS 2017 (val)Jaccard (Mean)85.5Cutie+ (base, MEGA)
VideoDAVIS 2017 (val)Speed (FPS)17.9Cutie+ (base, MEGA)
VideoDAVIS 2017 (val)F-measure (Mean)91.1Cutie (base)
VideoDAVIS 2017 (val)J&F87.9Cutie (base)
VideoDAVIS 2017 (val)Jaccard (Mean)84.6Cutie (base)
VideoDAVIS 2017 (val)Params(M)36.4Cutie (base)
VideoYouTube-VOS 2019F-Measure (Seen)90.6Cutie+ (base, MEGA)
VideoYouTube-VOS 2019F-Measure (Unseen)90.5Cutie+ (base, MEGA)
VideoYouTube-VOS 2019J&F17.9Cutie+ (base, MEGA)
VideoYouTube-VOS 2019Jaccard (Seen)86.3Cutie+ (base, MEGA)
VideoYouTube-VOS 2019Jaccard (Unseen)82.7Cutie+ (base, MEGA)
VideoYouTube-VOS 2019Overall87.5Cutie+ (base, MEGA)
VideoBURST-testHOTA (all)66Cutie (base, MEGA, 600 pixels)
VideoBURST-testHOTA (common)66.5Cutie (base, MEGA, 600 pixels)
VideoBURST-testHOTA (uncommon)65.9Cutie (base, MEGA, 600 pixels)
VideoBURST-testHOTA (all)62.6Cutie (base, with mose, 600 pixels)
VideoBURST-testHOTA (common)63.8Cutie (base, with mose, 600 pixels)
VideoBURST-testHOTA (uncommon)62.3Cutie (base, with mose, 600 pixels)
VideoDAVIS 2017 (test-dev)F-measure (Mean)91.4Cutie+ (base, MEGA)
VideoDAVIS 2017 (test-dev)FPS17.9Cutie+ (base, MEGA)
VideoDAVIS 2017 (test-dev)J&F88.1Cutie+ (base, MEGA)
VideoDAVIS 2017 (test-dev)Jaccard (Mean)84.7Cutie+ (base, MEGA)
VideoDAVIS 2017 (test-dev)F-measure (Mean)89.9Cutie (base, MEGA)
VideoDAVIS 2017 (test-dev)FPS36.4Cutie (base, MEGA)
VideoDAVIS 2017 (test-dev)J&F86.1Cutie (base, MEGA)
VideoDAVIS 2017 (test-dev)Jaccard (Mean)82.4Cutie (base, MEGA)
VideoDAVIS 2017 (test-dev)F-measure (Mean)89.2Cutie+ (base)
VideoDAVIS 2017 (test-dev)FPS17.9Cutie+ (base)
VideoDAVIS 2017 (test-dev)J&F85.9Cutie+ (base)
VideoDAVIS 2017 (test-dev)Jaccard (Mean)82.6Cutie+ (base)
VideoYouTube-VOS 2018F-Measure (Seen)91Cutie+ (base, MEGA)
VideoYouTube-VOS 2018F-Measure (Unseen)90.1Cutie+ (base, MEGA)
VideoYouTube-VOS 2018Jaccard (Seen)86.6Cutie+ (base, MEGA)
VideoYouTube-VOS 2018Jaccard (Unseen)82.2Cutie+ (base, MEGA)
VideoYouTube-VOS 2018Overall87.5Cutie+ (base, MEGA)
VideoYouTube-VOS 2018Speed (FPS)17.9Cutie+ (base, MEGA)
VideoBURST-valHOTA (all)61.2Cutie (base, MEGA, 600 pixels)
VideoBURST-valHOTA (common)65Cutie (base, MEGA, 600 pixels)
VideoBURST-valHOTA (uncommon)60.3Cutie (base, MEGA, 600 pixels)
VideoBURST-valHOTA (all)58.4Cutie (base, with mose, 600 pixels)
VideoBURST-valHOTA (common)61.8Cutie (base, with mose, 600 pixels)
VideoBURST-valHOTA (uncommon)57.5Cutie (base, with mose, 600 pixels)
Object TrackingDiDiTracking quality0.575Cutie
Video Object SegmentationMOSEJ&F68.3Cutie
Video Object SegmentationM$^3$-VOSAverage IOU74.6Cutie-base
Video Object SegmentationMOSEF75.8Cutie+ (base, MEGA)
Video Object SegmentationMOSEFPS17.9Cutie+ (base, MEGA)
Video Object SegmentationMOSEJ67.6Cutie+ (base, MEGA)
Video Object SegmentationMOSEJ&F71.7Cutie+ (base, MEGA)
Video Object SegmentationMOSEF74.5Cutie+ (small, MEGA)
Video Object SegmentationMOSEFPS20.6Cutie+ (small, MEGA)
Video Object SegmentationMOSEJ66Cutie+ (small, MEGA)
Video Object SegmentationMOSEJ&F70.3Cutie+ (small, MEGA)
Video Object SegmentationMOSEF74.1Cutie (base, MEGA)
Video Object SegmentationMOSEFPS36.4Cutie (base, MEGA)
Video Object SegmentationMOSEJ65.8Cutie (base, MEGA)
Video Object SegmentationMOSEJ&F69.9Cutie (base, MEGA)
Video Object SegmentationMOSEF72.9Cutie (small, MEGA)
Video Object SegmentationMOSEFPS45.5Cutie (small, MEGA)
Video Object SegmentationMOSEJ64.3Cutie (small, MEGA)
Video Object SegmentationMOSEJ&F68.6Cutie (small, MEGA)
Video Object SegmentationMOSEF72.3Cutie (base, with mose)
Video Object SegmentationMOSEFPS36.4Cutie (base, with mose)
Video Object SegmentationMOSEJ64.2Cutie (base, with mose)
Video Object SegmentationMOSEJ&F68.3Cutie (base, with mose)
Video Object SegmentationMOSEF71.7Cutie (small, with mose)
Video Object SegmentationMOSEFPS45.5Cutie (small, with mose)
Video Object SegmentationMOSEJ63.1Cutie (small, with mose)
Video Object SegmentationMOSEJ&F67.4Cutie (small, with mose)
Video Object SegmentationMOSEF67.9Cutie (base)
Video Object SegmentationMOSEFPS36.4Cutie (base)
Video Object SegmentationMOSEJ60Cutie (base)
Video Object SegmentationMOSEJ&F64Cutie (base)
Video Object SegmentationMOSEF66.2Cutie (small)
Video Object SegmentationMOSEFPS45.5Cutie (small)
Video Object SegmentationMOSEJ58.2Cutie (small)
Video Object SegmentationMOSEJ&F62.2Cutie (small)
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)93.4Cutie+ (base)
Video Object SegmentationDAVIS 2017 (val)J&F90.5Cutie+ (base)
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)87.5Cutie+ (base)
Video Object SegmentationDAVIS 2017 (val)Params(M)17.9Cutie+ (base)
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)90.8Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (val)J&F88.1Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)85.5Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (val)Speed (FPS)17.9Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)91.1Cutie (base)
Video Object SegmentationDAVIS 2017 (val)J&F87.9Cutie (base)
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)84.6Cutie (base)
Video Object SegmentationDAVIS 2017 (val)Params(M)36.4Cutie (base)
Video Object SegmentationYouTube-VOS 2019F-Measure (Seen)90.6Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2019F-Measure (Unseen)90.5Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2019J&F17.9Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2019Jaccard (Seen)86.3Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2019Jaccard (Unseen)82.7Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2019Overall87.5Cutie+ (base, MEGA)
Video Object SegmentationBURST-testHOTA (all)66Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-testHOTA (common)66.5Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-testHOTA (uncommon)65.9Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-testHOTA (all)62.6Cutie (base, with mose, 600 pixels)
Video Object SegmentationBURST-testHOTA (common)63.8Cutie (base, with mose, 600 pixels)
Video Object SegmentationBURST-testHOTA (uncommon)62.3Cutie (base, with mose, 600 pixels)
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)91.4Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)FPS17.9Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)J&F88.1Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)84.7Cutie+ (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)89.9Cutie (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)FPS36.4Cutie (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)J&F86.1Cutie (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)82.4Cutie (base, MEGA)
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)89.2Cutie+ (base)
Video Object SegmentationDAVIS 2017 (test-dev)FPS17.9Cutie+ (base)
Video Object SegmentationDAVIS 2017 (test-dev)J&F85.9Cutie+ (base)
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)82.6Cutie+ (base)
Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)91Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)90.1Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)86.6Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)82.2Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2018Overall87.5Cutie+ (base, MEGA)
Video Object SegmentationYouTube-VOS 2018Speed (FPS)17.9Cutie+ (base, MEGA)
Video Object SegmentationBURST-valHOTA (all)61.2Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-valHOTA (common)65Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-valHOTA (uncommon)60.3Cutie (base, MEGA, 600 pixels)
Video Object SegmentationBURST-valHOTA (all)58.4Cutie (base, with mose, 600 pixels)
Video Object SegmentationBURST-valHOTA (common)61.8Cutie (base, with mose, 600 pixels)
Video Object SegmentationBURST-valHOTA (uncommon)57.5Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationMOSEF75.8Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEFPS17.9Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ67.6Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ&F71.7Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEF74.5Cutie+ (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEFPS20.6Cutie+ (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ66Cutie+ (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ&F70.3Cutie+ (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEF74.1Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEFPS36.4Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ65.8Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ&F69.9Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationMOSEF72.9Cutie (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEFPS45.5Cutie (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ64.3Cutie (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEJ&F68.6Cutie (small, MEGA)
Semi-Supervised Video Object SegmentationMOSEF72.3Cutie (base, with mose)
Semi-Supervised Video Object SegmentationMOSEFPS36.4Cutie (base, with mose)
Semi-Supervised Video Object SegmentationMOSEJ64.2Cutie (base, with mose)
Semi-Supervised Video Object SegmentationMOSEJ&F68.3Cutie (base, with mose)
Semi-Supervised Video Object SegmentationMOSEF71.7Cutie (small, with mose)
Semi-Supervised Video Object SegmentationMOSEFPS45.5Cutie (small, with mose)
Semi-Supervised Video Object SegmentationMOSEJ63.1Cutie (small, with mose)
Semi-Supervised Video Object SegmentationMOSEJ&F67.4Cutie (small, with mose)
Semi-Supervised Video Object SegmentationMOSEF67.9Cutie (base)
Semi-Supervised Video Object SegmentationMOSEFPS36.4Cutie (base)
Semi-Supervised Video Object SegmentationMOSEJ60Cutie (base)
Semi-Supervised Video Object SegmentationMOSEJ&F64Cutie (base)
Semi-Supervised Video Object SegmentationMOSEF66.2Cutie (small)
Semi-Supervised Video Object SegmentationMOSEFPS45.5Cutie (small)
Semi-Supervised Video Object SegmentationMOSEJ58.2Cutie (small)
Semi-Supervised Video Object SegmentationMOSEJ&F62.2Cutie (small)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)93.4Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F90.5Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)87.5Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Params(M)17.9Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)90.8Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F88.1Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)85.5Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Speed (FPS)17.9Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)91.1Cutie (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F87.9Cutie (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)84.6Cutie (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Params(M)36.4Cutie (base)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019F-Measure (Seen)90.6Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019F-Measure (Unseen)90.5Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019J&F17.9Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019Jaccard (Seen)86.3Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019Jaccard (Unseen)82.7Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2019Overall87.5Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationBURST-testHOTA (all)66Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-testHOTA (common)66.5Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-testHOTA (uncommon)65.9Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-testHOTA (all)62.6Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-testHOTA (common)63.8Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-testHOTA (uncommon)62.3Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)91.4Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)FPS17.9Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F88.1Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)84.7Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)89.9Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)FPS36.4Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F86.1Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)82.4Cutie (base, MEGA)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)89.2Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)FPS17.9Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F85.9Cutie+ (base)
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)82.6Cutie+ (base)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Seen)91Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018F-Measure (Unseen)90.1Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Seen)86.6Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Jaccard (Unseen)82.2Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Overall87.5Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationYouTube-VOS 2018Speed (FPS)17.9Cutie+ (base, MEGA)
Semi-Supervised Video Object SegmentationBURST-valHOTA (all)61.2Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-valHOTA (common)65Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-valHOTA (uncommon)60.3Cutie (base, MEGA, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-valHOTA (all)58.4Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-valHOTA (common)61.8Cutie (base, with mose, 600 pixels)
Semi-Supervised Video Object SegmentationBURST-valHOTA (uncommon)57.5Cutie (base, with mose, 600 pixels)
Visual Object TrackingDiDiTracking quality0.575Cutie

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17