Majority-voting ensemble on best 7 models
Reported on 4 benchmarks across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- F0.5· 2024-04-23SOTA81.4
- F0.5· 2024-04-2371.8best: 72.8 (Ensembles of best 7 models + GRECO + GTP-rerank)
- Precision· 2024-04-2383.7best: 83.9 (Ensembles of best 7 models + GRECO + GTP-rerank)
- Recall· 2024-04-2345.7best: 53.8 (Unsupervised GEC + cLang8)