Spatio-Temporal Video Grounding on VidSTG

Metric: Declarative m_vIoU (higher is better)

LeaderboardDataset
Loading chart...