Question Answering on MuLD (NarrativeQA)

Metric: Rouge-L (higher is better)

LeaderboardDataset
Loading chart...