Vanilla DrQA (single model)
Reported on 3 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing3 results
- In-domain· 2018-08-2154.5best: 82.5 (BERT Large Augmented (single model))
- Out-of-domain· 2018-08-2147.9best: 77.6 (BERT Large Augmented (single model))
- Overall· 2018-08-2152.6best: 85 (GPT-3 175B (few-shot, k=32))