TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DocOIE: A Document-level Context-Aware Dataset for OpenIE

DocOIE: A Document-level Context-Aware Dataset for OpenIE

Kuicai Dong, Yilin Zhao, Aixin Sun, Jung-jae Kim, XiaoLi Li

2021-05-10Findings (ACL) 2021 8Open Information Extraction
PaperPDFCode(official)

Abstract

Open Information Extraction (OpenIE) aims to extract structured relational tuples (subject, relation, object) from sentences and plays critical roles for many downstream NLP applications. Existing solutions perform extraction at sentence level, without referring to any additional contextual information. In reality, however, a sentence typically exists as part of a document rather than standalone; we often need to access relevant contextual information around the sentence before we can accurately interpret it. As there is no document-level context-aware OpenIE dataset available, we manually annotate 800 sentences from 80 documents in two domains (Healthcare and Transportation) to form a DocOIE dataset for evaluation. In addition, we propose DocIE, a novel document-level context-aware OpenIE model. Our experimental results based on DocIE demonstrate that incorporating document-level context is helpful in improving OpenIE performance. Both DocOIE dataset and DocIE model are released for public.

Results

TaskDatasetMetricValueModel
Open Information ExtractionDocOIE-healthcareF160.8DocIE w transformer
Open Information ExtractionDocOIE-healthcareF155.8Reverb
Open Information ExtractionDocOIE-transportationF156.9DocIE w transformer
Open Information ExtractionDocOIE-transportationF149.7Reverb

Related Papers

ChatPD: An LLM-driven Paper-Dataset Networking System2025-05-28Long-context Non-factoid Question Answering in Indic Languages2025-04-18Few-shot Continual Relation Extraction via Open Information Extraction2025-02-23Testing Prompt Engineering Methods for Knowledge Extraction from Text2025-02-18Challenges in Expanding Portuguese Resources: A View from Open Information Extraction2025-01-21Neon: News Entity-Interaction Extraction for Enhanced Question Answering2024-11-19$\textit{BenchIE}^{FL}$ : A Manually Re-Annotated Fact-Based Open Information Extraction Benchmark2024-07-23Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs2024-06-27