TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ARTrackV2: Prompting Autoregressive Tracker Where to Look ...

ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe

Yifan Bai, Zeyang Zhao, Yihong Gong, Xing Wei

2023-12-28CVPR 2024 1Visual Object TrackingObject TrackingVideo Object Tracking
PaperPDFCode(official)

Abstract

We present ARTrackV2, which integrates two pivotal aspects of tracking: determining where to look (localization) and how to describe (appearance analysis) the target object across video frames. Building on the foundation of its predecessor, ARTrackV2 extends the concept by introducing a unified generative framework to "read out" object's trajectory and "retell" its appearance in an autoregressive manner. This approach fosters a time-continuous methodology that models the joint evolution of motion and visual features, guided by previous estimates. Furthermore, ARTrackV2 stands out for its efficiency and simplicity, obviating the less efficient intra-frame autoregression and hand-tuned parameters for appearance updates. Despite its simplicity, ARTrackV2 achieves state-of-the-art performance on prevailing benchmark datasets while demonstrating remarkable efficiency improvement. In particular, ARTrackV2 achieves AO score of 79.5\% on GOT-10k, and AUC of 86.1\% on TrackingNet while being $3.6 \times$ faster than ARTrack. The code will be released.

Results

TaskDatasetMetricValueModel
Object TrackingTNL2KAUC61.6ARTrackV2-L
Object TrackingUAV123AUC0.717ARTrackV2-L
Object TrackingLaSOTAUC73.6ARTrackV2-L
Object TrackingLaSOTNormalized Precision82.8ARTrackV2-L
Object TrackingLaSOTPrecision81.1ARTrackV2-L
Object TrackingNeedForSpeedAUC0.684ARTrackV2-L
Object TrackingGOT-10kAverage Overlap79.5ARTrackV2-L
Object TrackingGOT-10kSuccess Rate 0.587.8ARTrackV2-L
Object TrackingGOT-10kSuccess Rate 0.7579.6ARTrackV2-L
Object TrackingLaSOT-extAUC53.4ARTrackV2-L
Object TrackingLaSOT-extNormalized Precision63.7ARTrackV2-L
Object TrackingLaSOT-extPrecision60.2ARTrackV2-L
Object TrackingTrackingNetAccuracy86.1ARTrackV2-L
Object TrackingTrackingNetNormalized Precision90.4ARTrackV2-L
Object TrackingTrackingNetPrecision86.2ARTrackV2-L
Visual Object TrackingTNL2KAUC61.6ARTrackV2-L
Visual Object TrackingUAV123AUC0.717ARTrackV2-L
Visual Object TrackingLaSOTAUC73.6ARTrackV2-L
Visual Object TrackingLaSOTNormalized Precision82.8ARTrackV2-L
Visual Object TrackingLaSOTPrecision81.1ARTrackV2-L
Visual Object TrackingNeedForSpeedAUC0.684ARTrackV2-L
Visual Object TrackingGOT-10kAverage Overlap79.5ARTrackV2-L
Visual Object TrackingGOT-10kSuccess Rate 0.587.8ARTrackV2-L
Visual Object TrackingGOT-10kSuccess Rate 0.7579.6ARTrackV2-L
Visual Object TrackingLaSOT-extAUC53.4ARTrackV2-L
Visual Object TrackingLaSOT-extNormalized Precision63.7ARTrackV2-L
Visual Object TrackingLaSOT-extPrecision60.2ARTrackV2-L
Visual Object TrackingTrackingNetAccuracy86.1ARTrackV2-L
Visual Object TrackingTrackingNetNormalized Precision90.4ARTrackV2-L
Visual Object TrackingTrackingNetPrecision86.2ARTrackV2-L

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions2025-07-01Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking2025-06-30Visual and Memory Dual Adapter for Multi-Modal Object Tracking2025-06-30R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement Learning2025-06-27