Reading Comprehension on AdversarialQA

Metric: D(RoBERTa): F1 (higher is better)

LeaderboardDataset
Loading chart...
#ModelD(RoBERTa): F1Extra DataPaperDateCode
1BERT-Large54.4YesBeat the AI: Investigating Adversarial Human Ann...2020-02-02Code
2RoBERTa-Large53.4YesBeat the AI: Investigating Adversarial Human Ann...2020-02-02Code
3BiDAF26.7YesBeat the AI: Investigating Adversarial Human Ann...2020-02-02Code