TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Distractor-Aware Memory for Visual Object Tracking with ...

A Distractor-Aware Memory for Visual Object Tracking with SAM2

Jovana Videnovic, Alan Lukezic, Matej Kristan

2024-11-26CVPR 2025 1Visual Object TrackingSemi-Supervised Video Object SegmentationVisual TrackingObject Tracking
PaperPDFCode(official)

Abstract

Memory-based trackers are video object segmentation methods that form the target model by concatenating recently tracked frames into a memory buffer and localize the target by attending the current image to the buffered frames. While already achieving top performance on many benchmarks, it was the recent release of SAM2 that placed memory-based trackers into focus of the visual object tracking community. Nevertheless, modern trackers still struggle in the presence of distractors. We argue that a more sophisticated memory model is required, and propose a new distractor-aware memory model for SAM2 and an introspection-based update strategy that jointly addresses the segmentation accuracy as well as tracking robustness. The resulting tracker is denoted as SAM2.1++. We also propose a new distractor-distilled DiDi dataset to study the distractor problem better. SAM2.1++ outperforms SAM2.1 and related SAM memory extensions on seven benchmarks and sets a solid new state-of-the-art on six of them.

Results

TaskDatasetMetricValueModel
VideoVOT2020EAO0.729DAM4SAM
Object TrackingLaSOTAUC75.1DAM4SAM
Object TrackingDiDiTracking quality0.694DAM4SAM
Object TrackingGOT-10kAverage Overlap81.1DAM4SAM
Object TrackingLaSOT-extAUC60.9DAM4SAM
Object TrackingVOT2022EAO0.753DAM4SAM
Video Object SegmentationVOT2020EAO0.729DAM4SAM
Semi-Supervised Video Object SegmentationVOT2020EAO0.729DAM4SAM
Visual Object TrackingLaSOTAUC75.1DAM4SAM
Visual Object TrackingDiDiTracking quality0.694DAM4SAM
Visual Object TrackingGOT-10kAverage Overlap81.1DAM4SAM
Visual Object TrackingLaSOT-extAUC60.9DAM4SAM
Visual Object TrackingVOT2022EAO0.753DAM4SAM

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10What You Have is What You Track: Adaptive and Robust Multimodal Tracking2025-07-08Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions2025-07-01Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking2025-06-30Visual and Memory Dual Adapter for Multi-Modal Object Tracking2025-06-30