Common Sense Reasoning on SWAG

Metric: Dev (higher is better)

LeaderboardDataset
Loading chart...
#ModelDevExtra DataPaperDateCode
1BERT-LARGE86.6NoBERT: Pre-training of Deep Bidirectional Transfo...2018-10-11Code
2ESIM + ELMo59.1NoSWAG: A Large-Scale Adversarial Dataset for Grou...2018-08-16-
3ESIM + GloVe51.9NoSWAG: A Large-Scale Adversarial Dataset for Grou...2018-08-16-