TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fast Online Object Tracking and Segmentation: A Unifying A...

Fast Online Object Tracking and Segmentation: A Unifying Approach

Qiang Wang, Li Zhang, Luca Bertinetto, Weiming Hu, Philip H. S. Torr

2018-12-12CVPR 2019 6Visual Object TrackingSemi-Supervised Video Object SegmentationVisual TrackingReal-Time Visual TrackingSemi-Supervised Semantic SegmentationSegmentationVideo Object SegmentationObject TrackingVideo Object Tracking
PaperPDFCodeCodeCode

Abstract

In this paper we illustrate how to perform both visual object tracking and semi-supervised video object segmentation, in real-time, with a single simple approach. Our method, dubbed SiamMask, improves the offline training procedure of popular fully-convolutional Siamese approaches for object tracking by augmenting their loss with a binary segmentation task. Once trained, SiamMask solely relies on a single bounding box initialisation and operates online, producing class-agnostic object segmentation masks and rotated bounding boxes at 55 frames per second. Despite its simplicity, versatility and fast speed, our strategy allows us to establish a new state of the art among real-time trackers on VOT-2018, while at the same time demonstrating competitive performance and the best speed for the semi-supervised video object segmentation task on DAVIS-2016 and DAVIS-2017. The project website is http://www.robots.ox.ac.uk/~qwang/SiamMask.

Results

TaskDatasetMetricValueModel
VideoDAVIS 2017 (val)F-measure (Decay)20.9SiamMask
VideoDAVIS 2017 (val)F-measure (Mean)58.5SiamMask
VideoDAVIS 2017 (val)F-measure (Recall)67.5SiamMask
VideoDAVIS 2017 (val)J&F56.4SiamMask
VideoDAVIS 2017 (val)Jaccard (Decay)19.3SiamMask
VideoDAVIS 2017 (val)Jaccard (Mean)54.3SiamMask
VideoDAVIS 2017 (val)Jaccard (Recall)62.8SiamMask
VideoDAVIS 2016F-measure (Decay)2.1SiamMask
VideoDAVIS 2016F-measure (Mean)67.8SiamMask
VideoDAVIS 2016F-measure (Recall)79.8SiamMask
VideoDAVIS 2016J&F69.75SiamMask
VideoDAVIS 2016Jaccard (Decay)3SiamMask
VideoDAVIS 2016Jaccard (Mean)71.7SiamMask
VideoDAVIS 2016Jaccard (Recall)86.8SiamMask
VideoDAVIS 2017 (test-dev)F-measure (Decay)22.4SiamMask
VideoDAVIS 2017 (test-dev)F-measure (Mean)45.8SiamMask
VideoDAVIS 2017 (test-dev)F-measure (Recall)45.3SiamMask
VideoDAVIS 2017 (test-dev)J&F43.2SiamMask
VideoDAVIS 2017 (test-dev)Jaccard (Decay)21.9SiamMask
VideoDAVIS 2017 (test-dev)Jaccard (Mean)40.6SiamMask
VideoDAVIS 2017 (test-dev)Jaccard (Recall)44.5SiamMask
VideoNT-VOT211AUC35.14SiamMask
VideoNT-VOT211Precision46.49SiamMask
Object TrackingVOT2017/18Expected Average Overlap (EAO)0.38SiamMask
Object TrackingYouTube-VOS 2018F-Measure (Seen)58.2SiamMask
Object TrackingYouTube-VOS 2018F-Measure (Unseen)47.7SiamMask
Object TrackingYouTube-VOS 2018Jaccard (Seen)54.3SiamMask
Object TrackingYouTube-VOS 2018Jaccard (Unseen)45.1SiamMask
Object TrackingYouTube-VOS 2018O (Average of Measures)52.8SiamMask
Object TrackingNT-VOT211AUC35.14SiamMask
Object TrackingNT-VOT211Precision46.49SiamMask
Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)20.9SiamMask
Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)58.5SiamMask
Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)67.5SiamMask
Video Object SegmentationDAVIS 2017 (val)J&F56.4SiamMask
Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)19.3SiamMask
Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)54.3SiamMask
Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)62.8SiamMask
Video Object SegmentationDAVIS 2016F-measure (Decay)2.1SiamMask
Video Object SegmentationDAVIS 2016F-measure (Mean)67.8SiamMask
Video Object SegmentationDAVIS 2016F-measure (Recall)79.8SiamMask
Video Object SegmentationDAVIS 2016J&F69.75SiamMask
Video Object SegmentationDAVIS 2016Jaccard (Decay)3SiamMask
Video Object SegmentationDAVIS 2016Jaccard (Mean)71.7SiamMask
Video Object SegmentationDAVIS 2016Jaccard (Recall)86.8SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)22.4SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)45.8SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)45.3SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)J&F43.2SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)21.9SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)40.6SiamMask
Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)44.5SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Decay)20.9SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Mean)58.5SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)F-measure (Recall)67.5SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)J&F56.4SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Decay)19.3SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Mean)54.3SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (val)Jaccard (Recall)62.8SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Decay)2.1SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Mean)67.8SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016F-measure (Recall)79.8SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016J&F69.75SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Decay)3SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Mean)71.7SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2016Jaccard (Recall)86.8SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Decay)22.4SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Mean)45.8SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)F-measure (Recall)45.3SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)J&F43.2SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Decay)21.9SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Mean)40.6SiamMask
Semi-Supervised Video Object SegmentationDAVIS 2017 (test-dev)Jaccard (Recall)44.5SiamMask
Visual Object TrackingVOT2017/18Expected Average Overlap (EAO)0.38SiamMask
Visual Object TrackingYouTube-VOS 2018F-Measure (Seen)58.2SiamMask
Visual Object TrackingYouTube-VOS 2018F-Measure (Unseen)47.7SiamMask
Visual Object TrackingYouTube-VOS 2018Jaccard (Seen)54.3SiamMask
Visual Object TrackingYouTube-VOS 2018Jaccard (Unseen)45.1SiamMask
Visual Object TrackingYouTube-VOS 2018O (Average of Measures)52.8SiamMask

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17