Dense Unsupervised Learning for Video Segmentation

Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

2021-11-11NeurIPS 2021 12Unsupervised Video Object Segmentation Semi-Supervised Video Object Segmentation Segmentation Semantic Segmentation Video Segmentation Video Object Segmentation Video Semantic Segmentation

Paper PDF Code(official)

Abstract

We present a novel approach to unsupervised learning for video object segmentation (VOS). Unlike previous work, our formulation allows to learn dense feature representations directly in a fully convolutional regime. We rely on uniform grid sampling to extract a set of anchors and train our model to disambiguate between them on both inter- and intra-video levels. However, a naive scheme to train such a model results in a degenerate solution. We propose to prevent this with a simple regularisation scheme, accommodating the equivariance property of the segmentation task to similarity transformations. Our training objective admits efficient implementation and exhibits fast training convergence. On established VOS benchmarks, our approach exceeds the segmentation accuracy of previous work despite using significantly less training data and compute power.

Results

Task	Dataset	Metric	Value	Model
Video	DAVIS 2017 (val)	F-measure (Mean)	71.7	Araslanov et al.
Video	DAVIS 2017 (val)	F-measure (Recall)	84.8	Araslanov et al.
Video	DAVIS 2017 (val)	J&F	69.4	Araslanov et al.
Video	DAVIS 2017 (val)	Jaccard (Mean)	67.1	Araslanov et al.
Video	DAVIS 2017 (val)	Jaccard (Recall)	80.9	Araslanov et al.
Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	71.7	Araslanov et al.
Video Object Segmentation	DAVIS 2017 (val)	F-measure (Recall)	84.8	Araslanov et al.
Video Object Segmentation	DAVIS 2017 (val)	J&F	69.4	Araslanov et al.
Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	67.1	Araslanov et al.
Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Recall)	80.9	Araslanov et al.
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	F-measure (Mean)	71.7	Araslanov et al.
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	F-measure (Recall)	84.8	Araslanov et al.
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	J&F	69.4	Araslanov et al.
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Mean)	67.1	Araslanov et al.
Semi-Supervised Video Object Segmentation	DAVIS 2017 (val)	Jaccard (Recall)	80.9	Araslanov et al.

Dense Unsupervised Learning for Video Segmentation

Abstract

Results

Related Papers

Dense Unsupervised Learning for Video Segmentation

Abstract

Results

Related Papers