Metric: Test (higher is better)
| # | Model↕ | Test▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | DeBERTalarge | 90.8 | No | DeBERTa: Decoding-enhanced BERT with Disentangle... | 2020-06-05 | Code |
| 2 | RoBERTa | 89.9 | No | RoBERTa: A Robustly Optimized BERT Pretraining A... | 2019-07-26 | Code |
| 3 | BERT-LARGE | 86.3 | No | BERT: Pre-training of Deep Bidirectional Transfo... | 2018-10-11 | Code |
| 4 | ESIM + ELMo | 59.2 | No | SWAG: A Large-Scale Adversarial Dataset for Grou... | 2018-08-16 | - |
| 5 | ESIM + GloVe | 52.7 | No | SWAG: A Large-Scale Adversarial Dataset for Grou... | 2018-08-16 | - |