BiDAF++ (single model)
Reported on 7 benchmarks across 1 task · 1 paper · 3 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing7 results
- In-domain· 2018-09-27SOTA69.4best: 82.5 (BERT Large Augmented (single model))
- Out-of-domain· 2018-09-27SOTA63.8best: 77.6 (BERT Large Augmented (single model))
- Overall· 2018-09-27SOTA67.8best: 85 (GPT-3 175B (few-shot, k=32))
- 77.573best: 90.622 ({ANNA} (single model))
- 84.858best: 95.719 ({ANNA} (single model))
- 65.651best: 90.939 (IE-Net (ensemble))
- 68.866best: 93.214 (IE-Net (ensemble))