Cross Fusion RGB-T Tracking with Bi-directional Adapter

Zhirong Zeng, Xiaotao Liu, Meng Sun, Hongyu Wang, Jing Liu

2024-08-30Rgb-T Tracking

Abstract

Many state-of-the-art RGB-T trackers have achieved remarkable results through modality fusion. However, these trackers often either overlook temporal information or fail to fully utilize it, resulting in an ineffective balance between multi-modal and temporal information. To address this issue, we propose a novel Cross Fusion RGB-T Tracking architecture (CFBT) that ensures the full participation of multiple modalities in tracking while dynamically fusing temporal information. The effectiveness of CFBT relies on three newly designed cross spatio-temporal information fusion modules: Cross Spatio-Temporal Augmentation Fusion (CSTAF), Cross Spatio-Temporal Complementarity Fusion (CSTCF), and Dual-Stream Spatio-Temporal Adapter (DSTA). CSTAF employs a cross-attention mechanism to enhance the feature representation of the template comprehensively. CSTCF utilizes complementary information between different branches to enhance target features and suppress background features. DSTA adopts the adapter concept to adaptively fuse complementary information from multiple branches within the transformer layer, using the RGB modality as a medium. These ingenious fusions of multiple perspectives introduce only less than 0.3\% of the total modal parameters, but they indeed enable an efficient balance between multi-modal and temporal information. Extensive experiments on three popular RGB-T tracking benchmarks demonstrate that our method achieves new state-of-the-art performance.

Results

Task	Dataset	Metric	Value	Model
Visual Tracking	LasHeR	Precision	73.2	CFBT
Visual Tracking	LasHeR	Success	58.4	CFBT
Visual Tracking	RGBT234	Precision	89.9	CFBT
Visual Tracking	RGBT234	Success	65.9	CFBT
Visual Tracking	RGBT210	Precision	87.7	CFBT
Visual Tracking	RGBT210	Success	63	CFBT

Cross Fusion RGB-T Tracking with Bi-directional Adapter

Abstract

Results

Related Papers

Cross Fusion RGB-T Tracking with Bi-directional Adapter

Abstract

Results

Related Papers