A Dataset for N-ary Relation Extraction of Drug Combinations

Aryeh Tiktinsky, Vijay Viswanathan, Danna Niezni, Dana Meron Azagury, Yosi Shamay, Hillel Taub-Tabib, Tom Hope, Yoav Goldberg

2022-05-04NAACL 2022 7Relation Extraction Drug–drug Interaction Extraction

Paper PDF Code(official)Code

Abstract

Combination therapies have become the standard of care for diseases such as cancer, tuberculosis, malaria and HIV. However, the combinatorial set of available multi-drug treatments creates a challenge in identifying effective combination therapies available in a situation. To assist medical professionals in identifying beneficial drug-combinations, we construct an expert-annotated dataset for extracting information about the efficacy of drug combinations from the scientific literature. Beyond its practical utility, the dataset also presents a unique NLP challenge, as the first relation extraction dataset consisting of variable-length relations. Furthermore, the relations in this dataset predominantly require language understanding beyond the sentence level, adding to the challenge of this task. We provide a promising baseline model and identify clear areas for further improvement. We release our dataset, code, and baseline models publicly to encourage the NLP community to participate in this task.

Results

Task	Dataset	Metric	Value	Model
Information Extraction	Drug Combination Extraction Dataset	Exact Match F1 ("Any Combination")	69.4	PubmedBERT + PURE (domain-adapted)
Information Extraction	Drug Combination Extraction Dataset	Exact Match F1 ("Positive Combination")	61.8	PubmedBERT + PURE (domain-adapted)

Related Papers

DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations2025-07-08 Multiple Streams of Relation Extraction: Enriching and Recalling in Transformers2025-06-25 Chaining Event Spans for Temporal Relation Grounding2025-06-17 Summarization for Generative Relation Extraction in the Microbiome Domain2025-06-10 Conservative Bias in Large Language Models: Measuring Relation Predictions2025-06-09 Comparative Analysis of AI Agent Architectures for Entity Relationship Classification2025-06-03 CREFT: Sequential Multi-Agent LLM for Character Relation Extraction2025-05-30 Generating Diverse Training Samples for Relation Extraction with Large Language Models2025-05-29