FlowQA (single model)
Reported on 5 benchmarks across 1 task · 1 paper · 5 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- Out-of-domain· 2018-10-06SOTA71.8best: 77.6 (BERT Large Augmented (single model))
- Overall· 2018-10-06SOTA75best: 85 (GPT-3 175B (few-shot, k=32))
- F1· 2018-10-06SOTA64.1
- HEQD· 2018-10-06SOTA5.8
- HEQQ· 2018-10-06SOTA59.6