FiD+Distil
Reported on 5 benchmarks across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- EM· uses extra data· 2020-12-08SOTA72.1best: 87.5 (Claude 2 (few-shot, k=5))
- BLEU-1· 2020-12-0835.3best: 54.11 (Masque (NarrativeQA + MS MARCO))
- BLEU-4· 2020-12-087.5best: 30.43 (Masque (NarrativeQA + MS MARCO))
- METEOR· 2020-12-0811.1best: 26.13 (Masque (NarrativeQA + MS MARCO))
- Rouge-L· 2020-12-0832best: 59.87 (Masque (NarrativeQA + MS MARCO))