TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/REXEL: An End-to-end Model for Document-Level Relation Ext...

REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Nacime Bouziani, Shubhi Tyagi, Joseph Fisher, Jens Lehmann, Andrea Pierleoni

2024-04-19Relation ExtractionBenchmarkingcoreference-resolutionCoreference ResolutionEntity LinkingEntity RetrievalDocument-level Closed Information ExtractionDocument-level Relation ExtractionJoint Entity and Relation ExtractionRelation ClassificationNamed Entity Recognition (NER)Entity DisambiguationEntity Typing
PaperPDFCode(official)

Abstract

Extracting structured information from unstructured text is critical for many downstream NLP applications and is traditionally achieved by closed information extraction (cIE). However, existing approaches for cIE suffer from two limitations: (i) they are often pipelines which makes them prone to error propagation, and/or (ii) they are restricted to sentence level which prevents them from capturing long-range dependencies and results in expensive inference time. We address these limitations by proposing REXEL, a highly efficient and accurate model for the joint task of document level cIE (DocIE). REXEL performs mention detection, entity typing, entity disambiguation, coreference resolution and document-level relation classification in a single forward pass to yield facts fully linked to a reference knowledge graph. It is on average 11 times faster than competitive existing approaches in a similar setting and performs competitively both when optimised for any of the individual subtasks and a variety of combinations of different joint tasks, surpassing the baselines by an average of more than 6 F1 points. The combination of speed and accuracy makes REXEL an accurate cost-efficient system for extracting structured information at web-scale. We also release an extension of the DocRED dataset to enable benchmarking of future work on DocIE, which is available at https://github.com/amazon-science/e2e-docie.

Results

TaskDatasetMetricValueModel
Relation ExtractionDWIEF1-Hard65.8REXEL
Relation ExtractionDocRED-IERelation F160.1REXEL
Relation ExtractionDocRED-IERelation F139.06REXEL
Relation ExtractionDocREDRelation F139.06REXEL
Information ExtractionDocRED-IERelation F139.06REXEL
Information ExtractionDocREDRelation F139.06REXEL
Information ExtractionDocRED-IERelation F127.96REXEL
Information ExtractionDWIEF1-Hard53.77REXEL
Information ExtractionDocREDRelation F127.96REXEL
Named Entity Recognition (NER)DWIEF1-Hard90.59REXEL
Coreference ResolutionDWIEAvg. F195.12REXEL
Coreference ResolutionDocRED-IEAvg F190.93REXEL
Entity TypingDocRED-IEAvg F196.01REXEL
Entity DisambiguationDocRED-IEAvg F186.74REXEL
Document-level Relation ExtractionDocRED-IERelation F160.1REXEL

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15A Multi-View High-Resolution Foot-Ankle Complex Point Cloud Dataset During Gait for Occlusion-Robust 3D Completion2025-07-15FLsim: A Modular and Library-Agnostic Simulation Framework for Federated Learning2025-07-15