Spatio-Temporal Video Grounding on VidSTG

Metric: Declarative vIoU@0.3 (higher is better)

LeaderboardDataset
Loading chart...