BERT-LARGE (Ensemble+TriviaQA)
Reported on 4 benchmarks across 1 task · 1 paper · 3 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- EM· 2018-10-11SOTA86.2best: 90.06 (T5-11B)
- F1· 2018-10-11SOTA92.2best: 95.77 (XLNet+DSC)
- F1· 2018-10-11SOTA93.2best: 95.719 ({ANNA} (single model))
- EM· 2018-10-1187.4best: 90.622 ({ANNA} (single model))