LLaMA-2-70B w/ Selected Demo & Uncertainty
Reported on 2 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- F1· 2023-09-0751.5best: 54.8 (MacroIE)
- F1· 2023-09-0765.8best: 72.6 (DeepEx (zero-shot))