ARES
Reported on 5 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- SemEval 200771best: 80.9 (SANDWiCH)
- SemEval 201377.3best: 92.6 (SANDWiCH)
- SemEval 201583.2best: 91.5 (SANDWiCH)
- Senseval 278best: 87.8 (SANDWiCH)
- Senseval 377.1best: 85.7 (SANDWiCH)