Question Answering on GrailQA

Metric: I.I.D. EM (higher is better)

LeaderboardDataset