TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Joint Feature Learning and Relation Modeling for Tracking:...

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework

Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, Xilin Chen

2022-03-22Visual Object TrackingVisual TrackingObject TrackingVideo Object Tracking
PaperPDFCode(official)

Abstract

The current popular two-stream, two-stage tracking framework extracts the template and the search region features separately and then performs relation modeling, thus the extracted features lack the awareness of the target and have limited target-background discriminability. To tackle the above issue, we propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling by bridging the template-search image pairs with bidirectional information flows. In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance. Since no extra heavy relation modeling module is needed and the implementation is highly parallelized, the proposed tracker runs at a fast speed. To further improve the inference efficiency, an in-network candidate early elimination module is proposed based on the strong similarity prior calculated in the one-stream framework. As a unified framework, OSTrack achieves state-of-the-art performance on multiple benchmarks, in particular, it shows impressive results on the one-shot tracking benchmark GOT-10k, i.e., achieving 73.7% AO, improving the existing best result (SwinTrack) by 4.3\%. Besides, our method maintains a good performance-speed trade-off and shows faster convergence. The code and models are available at https://github.com/botaoye/OSTrack.

Results

TaskDatasetMetricValueModel
VideoNT-VOT211AUC38.59OSTrack-384
VideoNT-VOT211Precision53.06OSTrack-384
Visual TrackingTNL2KAUC55.9OSTrack
Object TrackingCOESOTPrecision Rate66.6OSTrack
Object TrackingCOESOTSuccess Rate59OSTrack
Object TrackingUAV123AUC0.707OSTrack -384
Object TrackingLaSOTAUC71.1OSTrack-384
Object TrackingLaSOTNormalized Precision81.1OSTrack-384
Object TrackingLaSOTPrecision77.6OSTrack-384
Object TrackingGOT-10kAverage Overlap73.7OSTrack-384
Object TrackingGOT-10kSuccess Rate 0.583.2OSTrack-384
Object TrackingGOT-10kSuccess Rate 0.7570.8OSTrack-384
Object TrackingLaSOT-extAUC50.6OSTrack
Object TrackingLaSOT-extNormalized Precision61.3OSTrack
Object TrackingLaSOT-extPrecision57.6OSTrack
Object TrackingTrackingNetAccuracy83.9OSTrack-384
Object TrackingTrackingNetNormalized Precision88.5OSTrack-384
Object TrackingTrackingNetPrecision83.2OSTrack-384
Object TrackingNT-VOT211AUC38.59OSTrack-384
Object TrackingNT-VOT211Precision53.06OSTrack-384
Visual Object TrackingUAV123AUC0.707OSTrack -384
Visual Object TrackingLaSOTAUC71.1OSTrack-384
Visual Object TrackingLaSOTNormalized Precision81.1OSTrack-384
Visual Object TrackingLaSOTPrecision77.6OSTrack-384
Visual Object TrackingGOT-10kAverage Overlap73.7OSTrack-384
Visual Object TrackingGOT-10kSuccess Rate 0.583.2OSTrack-384
Visual Object TrackingGOT-10kSuccess Rate 0.7570.8OSTrack-384
Visual Object TrackingLaSOT-extAUC50.6OSTrack
Visual Object TrackingLaSOT-extNormalized Precision61.3OSTrack
Visual Object TrackingLaSOT-extPrecision57.6OSTrack
Visual Object TrackingTrackingNetAccuracy83.9OSTrack-384
Visual Object TrackingTrackingNetNormalized Precision88.5OSTrack-384
Visual Object TrackingTrackingNetPrecision83.2OSTrack-384

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10What You Have is What You Track: Adaptive and Robust Multimodal Tracking2025-07-08Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions2025-07-01Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking2025-06-30Visual and Memory Dual Adapter for Multi-Modal Object Tracking2025-06-30