TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/On Generalization in Coreference Resolution

On Generalization in Coreference Resolution

Shubham Toshniwal, Patrick Xia, Sam Wiseman, Karen Livescu, Kevin Gimpel

2021-09-20CRAC (ACL) 2021 11coreference-resolutionCoreference ResolutionData Augmentation
PaperPDFCodeCode(official)

Abstract

While coreference resolution is defined independently of dataset domain, most models for performing coreference resolution do not transfer well to unseen domains. We consolidate a set of 8 coreference resolution datasets targeting different domains to evaluate the off-the-shelf performance of models. We then mix three datasets for training; even though their domain, annotation guidelines, and metadata differ, we propose a method for jointly training a single model on this heterogeneous data mixture by using data augmentation to account for annotation differences and sampling to balance the data quantities. We find that in a zero-shot setting, models trained on a single dataset transfer poorly while joint training yields improved overall performance, leading to better generalization in coreference resolution models. This work contributes a new benchmark for robust coreference resolution and multiple new state-of-the-art results.

Results

TaskDatasetMetricValueModel
Coreference ResolutionOntoNotesF180.6longdoc S (OntoNotes + 60k pseudo-singletons)
Coreference ResolutionOntoNotesF179.6longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference ResolutionOntoNotesF179.2longdoc S (OntoNotes + PreCo + LitBank)
Coreference ResolutionQuizbowlF142.9longdoc S (OntoNotes + PreCo + LitBank)
Coreference ResolutionWinograd Schema ChallengeAccuracy60.1longdoc S (OntoNotes + PreCo + LitBank)
Coreference ResolutionWinograd Schema ChallengeAccuracy59.4longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference ResolutionLitBankF178.2longdoc S (OntoNotes + PreCo + LitBank)
Coreference ResolutionWikiCorefF162.5longdoc S (ON + PreCo + LitBank + 30k pseudo-singletons)
Coreference ResolutionWikiCorefF160.3longdoc S (OntoNotes + PreCo + LitBank)
Coreference ResolutionPreCoF187.6longdoc S (OntoNotes + PreCo + LitBank)

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15Iceberg: Enhancing HLS Modeling with Synthetic Data2025-07-14AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs)2025-07-13FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11DS@GT at CheckThat! 2025: Detecting Subjectivity via Transfer-Learning and Corrective Data Augmentation2025-07-08