FLERT: Document-Level Features for Named Entity Recognition

Stefan Schweter, Alan Akbik

2020-11-13named-entity-recognition Named Entity Recognition NER Named Entity Recognition (NER)

Abstract

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Results

Task	Dataset	Metric	Value	Model
Named Entity Recognition (NER)	CoNLL 2003 (German)	F1	88.34	FLERT XLM-R
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	94.09	FLERT XLM-R
Named Entity Recognition (NER)	FindVehicle	F1 Score	80.9	FLERT
Named Entity Recognition (NER)	CoNLL 2002 (Spanish)	F1	90.14	FLERT XLM-R
Named Entity Recognition (NER)	CoNLL 2002 (Dutch)	F1	95.21	FLERT XLM-R
Named Entity Recognition (NER)	CoNLL 2003 (German) Revised	F1	92.23	FLERT XLM-R

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12 Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05 Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05 Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04 EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models2025-05-29 Label-Guided In-Context Learning for Named Entity Recognition2025-05-29