LLaMA-2-13B w/ Selected Demo & Uncertainty
Reported on 2 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- F1· 2023-09-0736.2best: 54.8 (MacroIE)
- F1· 2023-09-0736.9best: 72.6 (DeepEx (zero-shot))