TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor ...

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li

2024-03-29Video SegmentationVideo Semantic SegmentationVideo Instance Segmentation
PaperPDFCodeCodeCode(official)

Abstract

Modern video segmentation methods adopt object queries to perform inter-frame association and demonstrate satisfactory performance in tracking continuously appearing objects despite large-scale motion and transient occlusion. However, they all underperform on newly emerging and disappearing objects that are common in the real world because they attempt to model object emergence and disappearance through feature transitions between background and foreground queries that have significant feature gaps. We introduce Dynamic Anchor Queries (DAQ) to shorten the transition gap between the anchor and target queries by dynamically generating anchor queries based on the features of potential candidates. Furthermore, we introduce a query-level object Emergence and Disappearance Simulation (EDS) strategy, which unleashes DAQ's potential without any additional cost. Finally, we combine our proposed DAQ and EDS with DVIS to obtain DVIS-DAQ. Extensive experiments demonstrate that DVIS-DAQ achieves a new state-of-the-art (SOTA) performance on five mainstream video segmentation benchmarks. Code and models are available at \url{https://github.com/SkyworkAI/DAQ-VS}.

Results

TaskDatasetMetricValueModel
Video Instance SegmentationYouTube-VIS 2021AP5086.1DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationYouTube-VIS 2021AP7572.2DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationYouTube-VIS 2021AR149.6DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationYouTube-VIS 2021AR1070.7DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationYouTube-VIS 2021mask AP64.5DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationOVIS validationAP5083.8DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationOVIS validationAP7562.9DVIS-DAQ(VIT-L, Offline)
Video Instance SegmentationOVIS validationmask AP57.1DVIS-DAQ(VIT-L, Offline)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation2025-07-13MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation2025-07-10Beyond Appearance: Geometric Cues for Robust Video Instance Segmentation2025-07-08Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder2025-06-28CogGen: A Learner-Centered Generative AI Architecture for Intelligent Tutoring with Programming Video2025-06-25Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment2025-06-17A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects2025-06-16