BERT for Coreference Resolution: Baselines and Analysis

Mandar Joshi, Omer Levy, Daniel S. Weld, Luke Zettlemoyer

2019-08-24IJCNLP 2019 11Coreference Resolution

Abstract

We apply BERT to coreference resolution, achieving strong improvements on the OntoNotes (+3.9 F1) and GAP (+11.5 F1) benchmarks. A qualitative analysis of model predictions indicates that, compared to ELMo and BERT-base, BERT-large is particularly better at distinguishing between related but distinct entities (e.g., President and CEO). However, there is still room for improvement in modeling document-level context, conversations, and mention paraphrasing. Our code and models are publicly available.

Results

Task	Dataset	Metric	Value	Model
Coreference Resolution	OntoNotes	F1	76.9	BERT-large
Coreference Resolution	OntoNotes	F1	73.9	BERT-base
Coreference Resolution	CoNLL 2012	Avg F1	76.9	c2f-coref + BERT-large

Related Papers

CORE-KG: An LLM-Driven Knowledge Graph Construction Framework for Human Smuggling Networks2025-06-20 Disambiguating Reference in Visually Grounded Dialogues through Joint Modeling of Textual and Multimodal Semantic Structures2025-05-16 Multimodal Coreference Resolution for Chinese Social Media Dialogues: Dataset and Benchmark Approach2025-04-19 Long-context Non-factoid Question Answering in Indic Languages2025-04-18 RAKG:Document-level Retrieval Augmented Knowledge Graph Construction2025-04-14 Cross-Document Contextual Coreference Resolution in Knowledge Graphs2025-04-08 A Rule Based Solution to Co-reference Resolution in Clinical Text2025-03-12 LegalCore: A Dataset for Legal Documents Event Coreference Resolution2025-02-18