DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li

2024-03-29Video Segmentation Video Semantic Segmentation Video Instance Segmentation

Abstract

Modern video segmentation methods adopt object queries to perform inter-frame association and demonstrate satisfactory performance in tracking continuously appearing objects despite large-scale motion and transient occlusion. However, they all underperform on newly emerging and disappearing objects that are common in the real world because they attempt to model object emergence and disappearance through feature transitions between background and foreground queries that have significant feature gaps. We introduce Dynamic Anchor Queries (DAQ) to shorten the transition gap between the anchor and target queries by dynamically generating anchor queries based on the features of potential candidates. Furthermore, we introduce a query-level object Emergence and Disappearance Simulation (EDS) strategy, which unleashes DAQ's potential without any additional cost. Finally, we combine our proposed DAQ and EDS with DVIS to obtain DVIS-DAQ. Extensive experiments demonstrate that DVIS-DAQ achieves a new state-of-the-art (SOTA) performance on five mainstream video segmentation benchmarks. Code and models are available at \url{https://github.com/SkyworkAI/DAQ-VS}.

Results

Task	Dataset	Metric	Value	Model
Video Instance Segmentation	YouTube-VIS 2021	AP50	86.1	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	YouTube-VIS 2021	AP75	72.2	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	YouTube-VIS 2021	AR1	49.6	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	YouTube-VIS 2021	AR10	70.7	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	YouTube-VIS 2021	mask AP	64.5	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	OVIS validation	AP50	83.8	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	OVIS validation	AP75	62.9	DVIS-DAQ(VIT-L, Offline)
Video Instance Segmentation	OVIS validation	mask AP	57.1	DVIS-DAQ(VIT-L, Offline)

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Abstract

Results

Related Papers

DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries

Abstract

Results

Related Papers