Revisiting RGBT Tracking Benchmarks from the Perspective of Modality Validity: A New Benchmark, Problem, and Method

Zhangyong Tang, Tianyang Xu, ZhenHua Feng, XueFeng Zhu, He Wang, Pengcheng Shao, Chunyang Cheng, Xiao-Jun Wu, Muhammad Awais, Sara Atito, Josef Kittler

2024-04-30Rgb-T Tracking

Paper PDF Code(official)

Abstract

RGBT tracking draws increasing attention due to its robustness in multi-modality warranting (MMW) scenarios, such as nighttime and bad weather, where relying on a single sensing modality fails to ensure stable tracking results. However, the existing benchmarks predominantly consist of videos collected in common scenarios where both RGB and thermal infrared (TIR) information are of sufficient quality. This makes the data unrepresentative of severe imaging conditions, leading to tracking failures in MMW scenarios. To bridge this gap, we present a new benchmark, MV-RGBT, captured specifically in MMW scenarios. In contrast with the existing datasets, MV-RGBT comprises more object categories and scenes, providing a diverse and challenging benchmark. Furthermore, for severe imaging conditions of MMW scenarios, a new problem is posed, namely \textit{when to fuse}, to stimulate the development of fusion strategies for such data. We propose a new method based on a mixture of experts, namely MoETrack, as a baseline fusion strategy. In MoETrack, each expert generates independent tracking results along with the corresponding confidence score, which is used to control the fusion process. Extensive experimental results demonstrate the significant potential of MV-RGBT in advancing RGBT tracking and elicit the conclusion that fusion is not always beneficial, especially in MMW scenarios. Significantly, the proposed MoETrack method achieves new state-of-the-art results not only on MV-RGBT, but also on standard benchmarks, such as RGBT234, LasHeR, and the short-term split of VTUAV (VTUAV-ST). More information of MV-RGBT and the source code of MoETrack will be released at https://github.com/Zhangyong-Tang/MoETrack.

Results

Task	Dataset	Metric	Value	Model
Visual Tracking	LasHeR	Precision	72.1	MoETrack
Visual Tracking	LasHeR	Success	57.8	MoETrack
Visual Tracking	GTOT	Precision	93.6	MoETrack
Visual Tracking	GTOT	Success	78.4	MoETrack
Visual Tracking	RGBT234	Precision	88.1	MoETrack
Visual Tracking	RGBT234	Success	65.1	MoETrack

Abstract

Task

Dataset

Metric

Value

Model

Visual Tracking

LasHeR

Precision

72.1

MoETrack

Visual Tracking

LasHeR

Success

57.8

MoETrack

Visual Tracking

GTOT

Precision

93.6

MoETrack

Visual Tracking

GTOT

Success

78.4

MoETrack

Visual Tracking

RGBT234

Precision

88.1

MoETrack

Visual Tracking

RGBT234

Success

65.1

MoETrack

Revisiting RGBT Tracking Benchmarks from the Perspective of Modality Validity: A New Benchmark, Problem, and Method

Abstract

Results

Related Papers

Revisiting RGBT Tracking Benchmarks from the Perspective of Modality Validity: A New Benchmark, Problem, and Method

Abstract

Results

Related Papers