Speech Recognition on VIVOS

Metric: Test WER (lower is better)

LeaderboardDataset