NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne Bernard

2024-02-23Few-shot NER named-entity-recognition Named Entity Recognition NER Named Entity Recognition (NER)Zero-shot Named Entity Recognition (NER)

Paper PDF Code(official)

Abstract

Large Language Models (LLMs) have shown impressive abilities in data annotation, opening the way for new approaches to solve classic NLP problems. In this paper, we show how to use LLMs to create NuNER, a compact language representation model specialized in the Named Entity Recognition (NER) task. NuNER can be fine-tuned to solve downstream NER problems in a data-efficient way, outperforming similar-sized foundation models in the few-shot regime and competing with much larger LLMs. We find that the size and entity-type diversity of the pre-training dataset are key to achieving good performance. We view NuNER as a member of the broader family of task-specific foundation models, recently unlocked by LLMs.

Results

Task	Dataset	Metric	Value	Model
Named Entity Recognition (NER)	NCBI-disease	F1	61.1	NuNER Zero Span
Named Entity Recognition (NER)	Ontonotes v5 (English)	F1	89.1	NuNER
Named Entity Recognition (NER)	Ontonotes v5 (English)	Precision	87.8	NuNER
Named Entity Recognition (NER)	Ontonotes v5 (English)	Recall	90.5	NuNER
Named Entity Recognition (NER)	Few-NERD (SUP)	F1-Measure	69.4	NuNER
Named Entity Recognition (NER)	Few-NERD (SUP)	Precision	67.8	NuNER
Named Entity Recognition (NER)	Few-NERD (SUP)	Recall	71.1	NuNER
Named Entity Recognition (NER)	CrossNER	AI	61.7	NuNERZero span
Named Entity Recognition (NER)	CrossNER	Literature	64.9	NuNERZero span
Named Entity Recognition (NER)	CrossNER	Music	69.9	NuNERZero span
Named Entity Recognition (NER)	CrossNER	Politics	71.7	NuNERZero span
Named Entity Recognition (NER)	CrossNER	Science	65.4	NuNERZero span
Named Entity Recognition (NER)	HarveyNER	Entity F1	24.9	NuNER Zero Span
Named Entity Recognition (NER)	Broad Twitter Corpus	Entity F1	60.2	NuNerZero Span

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Abstract

Results

Related Papers

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Abstract

Results

Related Papers