REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Nacime Bouziani, Shubhi Tyagi, Joseph Fisher, Jens Lehmann, Andrea Pierleoni

2024-04-19Relation Extraction Benchmarking coreference-resolution Coreference Resolution Entity Linking Entity Retrieval Document-level Closed Information Extraction Document-level Relation Extraction Joint Entity and Relation Extraction Relation Classification Named Entity Recognition (NER)Entity Disambiguation Entity Typing

Paper PDF Code(official)

Abstract

Extracting structured information from unstructured text is critical for many downstream NLP applications and is traditionally achieved by closed information extraction (cIE). However, existing approaches for cIE suffer from two limitations: (i) they are often pipelines which makes them prone to error propagation, and/or (ii) they are restricted to sentence level which prevents them from capturing long-range dependencies and results in expensive inference time. We address these limitations by proposing REXEL, a highly efficient and accurate model for the joint task of document level cIE (DocIE). REXEL performs mention detection, entity typing, entity disambiguation, coreference resolution and document-level relation classification in a single forward pass to yield facts fully linked to a reference knowledge graph. It is on average 11 times faster than competitive existing approaches in a similar setting and performs competitively both when optimised for any of the individual subtasks and a variety of combinations of different joint tasks, surpassing the baselines by an average of more than 6 F1 points. The combination of speed and accuracy makes REXEL an accurate cost-efficient system for extracting structured information at web-scale. We also release an extension of the DocRED dataset to enable benchmarking of future work on DocIE, which is available at https://github.com/amazon-science/e2e-docie.

Results

Task	Dataset	Metric	Value	Model
Relation Extraction	DWIE	F1-Hard	65.8	REXEL
Relation Extraction	DocRED-IE	Relation F1	60.1	REXEL
Relation Extraction	DocRED-IE	Relation F1	39.06	REXEL
Relation Extraction	DocRED	Relation F1	39.06	REXEL
Information Extraction	DocRED-IE	Relation F1	39.06	REXEL
Information Extraction	DocRED	Relation F1	39.06	REXEL
Information Extraction	DocRED-IE	Relation F1	27.96	REXEL
Information Extraction	DWIE	F1-Hard	53.77	REXEL
Information Extraction	DocRED	Relation F1	27.96	REXEL
Named Entity Recognition (NER)	DWIE	F1-Hard	90.59	REXEL
Coreference Resolution	DWIE	Avg. F1	95.12	REXEL
Coreference Resolution	DocRED-IE	Avg F1	90.93	REXEL
Entity Typing	DocRED-IE	Avg F1	96.01	REXEL
Entity Disambiguation	DocRED-IE	Avg F1	86.74	REXEL
Document-level Relation Extraction	DocRED-IE	Relation F1	60.1	REXEL

REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Abstract

Results

Related Papers

REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Abstract

Results

Related Papers