Christoph Mayer, Martin Danelljan, Danda Pani Paudel, Luc van Gool
The presence of objects that are confusingly similar to the tracked target, poses a fundamental challenge in appearance-based visual tracking. Such distractor objects are easily misclassified as the target itself, leading to eventual tracking failure. While most methods strive to suppress distractors through more powerful appearance models, we take an alternative approach. We propose to keep track of distractor objects in order to continue tracking the target. To this end, we introduce a learned association network, allowing us to propagate the identities of all target candidates from frame-to-frame. To tackle the problem of lacking ground-truth correspondences between distractor objects in visual tracking, we propose a training strategy that combines partial annotations with self-supervision. We conduct comprehensive experimental validation and analysis of our approach on several challenging datasets. Our tracker sets a new state-of-the-art on six benchmarks, achieving an AUC score of 67.1% on LaSOT and a +5.8% absolute gain on the OxUvA long-term dataset.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Video | NT-VOT211 | AUC | 39.59 | KeepTrack |
| Video | NT-VOT211 | Precision | 55.5 | KeepTrack |
| Object Tracking | COESOT | Precision Rate | 66.1 | KeepTrack |
| Object Tracking | COESOT | Success Rate | 59.6 | KeepTrack |
| Object Tracking | UAV123 | AUC | 0.697 | KeepTrack |
| Object Tracking | LaSOT | AUC | 67.1 | KeepTrack |
| Object Tracking | LaSOT | Normalized Precision | 77.2 | KeepTrack |
| Object Tracking | LaSOT | Precision | 70.2 | KeepTrack |
| Object Tracking | DiDi | Tracking quality | 0.502 | KeepTrack |
| Object Tracking | LaSOT-ext | AUC | 48.2 | KeepTrack |
| Object Tracking | OTB-2015 | AUC | 0.709 | KeepTrack |
| Object Tracking | NT-VOT211 | AUC | 39.59 | KeepTrack |
| Object Tracking | NT-VOT211 | Precision | 55.5 | KeepTrack |
| Visual Object Tracking | UAV123 | AUC | 0.697 | KeepTrack |
| Visual Object Tracking | LaSOT | AUC | 67.1 | KeepTrack |
| Visual Object Tracking | LaSOT | Normalized Precision | 77.2 | KeepTrack |
| Visual Object Tracking | LaSOT | Precision | 70.2 | KeepTrack |
| Visual Object Tracking | DiDi | Tracking quality | 0.502 | KeepTrack |
| Visual Object Tracking | LaSOT-ext | AUC | 48.2 | KeepTrack |
| Visual Object Tracking | OTB-2015 | AUC | 0.709 | KeepTrack |