ESR base
Reported on 4 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- F1 (Fewshot Test)77.8best: 90.7 (GlossGPT)
- F1 (Zero shot test)71.6best: 79.5 (GlossGPT)
- F1 (Zeroshot Dev)73.9best: 81.8 (GlossGPT)
- F1(FewShot Dev)77.9best: 90.2 (GlossGPT)