AUUH_A
Reported on 2 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- f1 macro avg (subtask 2)89best: 89.77 (AUUH_B)
- lev dist (subtask 2)4.08best: 36.38 (AUUH_D)