TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deeper Task-Specificity Improves Joint Entity and Relation...

Deeper Task-Specificity Improves Joint Entity and Relation Extraction

Phil Crone

2020-02-15Relation Extractionnamed-entity-recognitionNamed Entity RecognitionNERMulti-Task LearningSpecificityJoint Entity and Relation ExtractionNamed Entity Recognition (NER)
PaperPDFCode

Abstract

Multi-task learning (MTL) is an effective method for learning related tasks, but designing MTL models necessitates deciding which and how many parameters should be task-specific, as opposed to shared between tasks. We investigate this issue for the problem of jointly learning named entity recognition (NER) and relation extraction (RE) and propose a novel neural architecture that allows for deeper task-specificity than does prior work. In particular, we introduce additional task-specific bidirectional RNN layers for both the NER and RE tasks and tune the number of shared and task-specific layers separately for different datasets. We achieve state-of-the-art (SOTA) results for both tasks on the ADE dataset; on the CoNLL04 dataset, we achieve SOTA results on the NER task and competitive results on the RE task while using an order of magnitude fewer trainable parameters than the current SOTA architecture. An ablation study confirms the importance of the additional task-specific layers for achieving these results. Our work suggests that previous solutions to joint NER and RE undervalue task-specificity and demonstrates the importance of correctly balancing the number of shared and task-specific parameters for MTL approaches in general.

Results

TaskDatasetMetricValueModel
Relation ExtractionAdverse Drug Events (ADE) CorpusNER Macro F189.48Deeper
Relation ExtractionAdverse Drug Events (ADE) CorpusRE+ Macro F183.74Deeper
Relation ExtractionCoNLL04NER Macro F187Deeper
Relation ExtractionCoNLL04NER Micro F189.78Deeper
Relation ExtractionCoNLL04RE+ Macro F1 72.63Deeper
Relation ExtractionCoNLL04RE+ Micro F171.08Deeper

Related Papers

SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Robust-Multi-Task Gradient Boosting2025-07-15RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features2025-07-11SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation2025-07-10DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations2025-07-08Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28PoseMaster: Generating 3D Characters in Arbitrary Poses from a Single Image2025-06-26