Question Answering on MuLD (NarrativeQA)

Metric: BLEU-4 (higher is better)

LeaderboardDataset
Loading chart...