Full Ensemble

Reported on 4 benchmarks across 1 task · 1 paper · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing4 results

Coreference ResolutiononGAP
Bias (F/M)· 2019-06-09
0.98
best: 0.99 (Coref-MTL)
SOTA
Gendered Pronoun Resolution using BERT and an extractive question answering formulation arXiv:1906.03695
Coreference ResolutiononGAP
Feminine F1 (F)· 2019-06-09
89.5
best: 92.45 (Coref-MTL)
SOTA
Gendered Pronoun Resolution using BERT and an extractive question answering formulation arXiv:1906.03695
Coreference ResolutiononGAP
Masculine F1 (M)· 2019-06-09
90.9
best: 94 (ProBERT)
SOTA
Gendered Pronoun Resolution using BERT and an extractive question answering formulation arXiv:1906.03695
Coreference ResolutiononGAP
Overall F1· 2019-06-09
90.2
best: 92.72 (Coref-MTL)
SOTA
Gendered Pronoun Resolution using BERT and an extractive question answering formulation arXiv:1906.03695