LLaMA 13B (zero-shot)
Reported on 6 benchmarks across 3 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- Accuracy (High)· 2023-02-2747.2best: 92.6 (ALBERTxxlarge+DUMA(ensemble))
- Accuracy (Middle)· 2023-02-2761.6best: 93.1 (Megatron-BERT (ensemble))
- Accuracy· 2023-02-2750.4best: 83.2 (Unicorn 11B (fine-tuned))
- Accuracy· 2023-02-2756.4best: 78.4 (FLAN 137B (zero-shot))
- Accuracy· 2023-02-2778.1best: 99.87 (Mistral-Nemo 12B (HPT))
- Accuracy· 2023-02-2752.7best: 96.4 (GPT-4 (few-shot, k=25))