NLNDE at CANTEMIST: Neural Sequence Labeling and Parsing Approaches for Clinical Concept Extraction

Lukas Lange, Xiang Dai, Heike Adel, Jannik Strötgen

2020-10-23Clinical Concept Extraction

Abstract

The recognition and normalization of clinical information, such as tumor morphology mentions, is an important, but complex process consisting of multiple subtasks. In this paper, we describe our system for the CANTEMIST shared task, which is able to extract, normalize and rank ICD codes from Spanish electronic health records using neural sequence labeling and parsing approaches with context-aware embeddings. Our best system achieves 85.3 F1, 76.7 F1, and 77.0 MAP for the three tasks, respectively.

Related Papers

Selective Attention Federated Learning: Improving Privacy and Efficiency for Clinical Text Classification2025-04-16 BURExtract-Llama: An LLM for Clinical Concept Extraction in Breast Ultrasound Reports2024-08-21 Clinical Concept and Relation Extraction Using Prompt-based Machine Reading Comprehension2023-03-14 Accurate clinical and biomedical Named entity recognition at scale2022-07-19 GatorTron: A Large Clinical Language Model to Unlock Patient Information from Unstructured Electronic Health Records2022-02-02 CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain2021-12-16 Improving Clinical Document Understanding on COVID-19 Research with Spark NLP2020-12-07 CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters2020-10-20