DocQA + ELMo

Reported on 2 benchmarks across 1 task · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing2 results

Common Sense ReasoningonReCoRD
EM· 2018-10-30
45.4
best: 95.9 (Turing NLR v5 XXL 5.4B (fine-tuned))
ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension arXiv:1810.12885
Common Sense ReasoningonReCoRD
F1· 2018-10-30
46.7
best: 96.4 (Turing NLR v5 XXL 5.4B (fine-tuned))
ReCoRD: Bridging the Gap between Human and Machine Commonsense Reading Comprehension arXiv:1810.12885