On Generalization in Coreference Resolution

Shubham Toshniwal, Patrick Xia, Sam Wiseman, Karen Livescu, Kevin Gimpel

2021-09-20CRAC (ACL) 2021 11coreference-resolution Coreference Resolution Data Augmentation

Abstract

While coreference resolution is defined independently of dataset domain, most models for performing coreference resolution do not transfer well to unseen domains. We consolidate a set of 8 coreference resolution datasets targeting different domains to evaluate the off-the-shelf performance of models. We then mix three datasets for training; even though their domain, annotation guidelines, and metadata differ, we propose a method for jointly training a single model on this heterogeneous data mixture by using data augmentation to account for annotation differences and sampling to balance the data quantities. We find that in a zero-shot setting, models trained on a single dataset transfer poorly while joint training yields improved overall performance, leading to better generalization in coreference resolution models. This work contributes a new benchmark for robust coreference resolution and multiple new state-of-the-art results.

Results

Task	Dataset	Metric	Value	Model
Coreference Resolution	OntoNotes	F1	80.6	longdoc S (OntoNotes + 60k pseudo-singletons)
Coreference Resolution	OntoNotes	F1	79.6	longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference Resolution	OntoNotes	F1	79.2	longdoc S (OntoNotes + PreCo + LitBank)
Coreference Resolution	Quizbowl	F1	42.9	longdoc S (OntoNotes + PreCo + LitBank)
Coreference Resolution	Winograd Schema Challenge	Accuracy	60.1	longdoc S (OntoNotes + PreCo + LitBank)
Coreference Resolution	Winograd Schema Challenge	Accuracy	59.4	longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference Resolution	LitBank	F1	78.2	longdoc S (OntoNotes + PreCo + LitBank)
Coreference Resolution	WikiCoref	F1	62.5	longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference Resolution	WikiCoref	F1	60.3	longdoc S (OntoNotes + PreCo + LitBank)
Coreference Resolution	PreCo	F1	87.6	longdoc S (OntoNotes + PreCo + LitBank)

On Generalization in Coreference Resolution

Abstract

Results

Related Papers

On Generalization in Coreference Resolution

Abstract

Results

Related Papers