Full Ensemble
Reported on 4 benchmarks across 1 task · 1 paper · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- Bias (F/M)· 2019-06-09SOTA0.98best: 0.99 (Coref-MTL)
- Feminine F1 (F)· 2019-06-09SOTA89.5best: 92.45 (Coref-MTL)
- Masculine F1 (M)· 2019-06-09SOTA90.9best: 94 (ProBERT)
- Overall F1· 2019-06-09SOTA90.2best: 92.72 (Coref-MTL)