Hybrid H3 355M (3-shot, logit scoring)
Reported on 1 benchmark across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing1 result
- EM· 2022-12-2859.7best: 69.2 (PaLM 540B (finetuned) )