LLaMA 7B (zero-shot)
Reported on 6 benchmarks across 3 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- Accuracy (High)· 2023-02-2746.9best: 92.6 (ALBERTxxlarge+DUMA(ensemble))
- Accuracy (Middle)· 2023-02-2761.1best: 93.1 (Megatron-BERT (ensemble))
- Accuracy· 2023-02-2748.9best: 83.2 (Unicorn 11B (fine-tuned))
- Accuracy· 2023-02-2757.2best: 78.4 (FLAN 137B (zero-shot))
- Accuracy· 2023-02-2776.5best: 99.87 (Mistral-Nemo 12B (HPT))
- Accuracy· 2023-02-2747.6best: 96.4 (GPT-4 (few-shot, k=25))