Speech Recognition on Fongbe audio

Metric: Word Error Rate (WER) (lower is better)

LeaderboardDataset