TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/NuNER: Entity Recognition Encoder Pre-training via LLM-Ann...

NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data

Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoit Crabbé, Etienne Bernard

2024-02-23Few-shot NERnamed-entity-recognitionNamed Entity RecognitionNERNamed Entity Recognition (NER)Zero-shot Named Entity Recognition (NER)
PaperPDFCode(official)

Abstract

Large Language Models (LLMs) have shown impressive abilities in data annotation, opening the way for new approaches to solve classic NLP problems. In this paper, we show how to use LLMs to create NuNER, a compact language representation model specialized in the Named Entity Recognition (NER) task. NuNER can be fine-tuned to solve downstream NER problems in a data-efficient way, outperforming similar-sized foundation models in the few-shot regime and competing with much larger LLMs. We find that the size and entity-type diversity of the pre-training dataset are key to achieving good performance. We view NuNER as a member of the broader family of task-specific foundation models, recently unlocked by LLMs.

Results

TaskDatasetMetricValueModel
Named Entity Recognition (NER)NCBI-diseaseF161.1NuNER Zero Span
Named Entity Recognition (NER)Ontonotes v5 (English)F189.1NuNER
Named Entity Recognition (NER)Ontonotes v5 (English)Precision87.8NuNER
Named Entity Recognition (NER)Ontonotes v5 (English)Recall90.5NuNER
Named Entity Recognition (NER)Few-NERD (SUP)F1-Measure69.4NuNER
Named Entity Recognition (NER)Few-NERD (SUP)Precision67.8NuNER
Named Entity Recognition (NER)Few-NERD (SUP)Recall71.1NuNER
Named Entity Recognition (NER)CrossNERAI61.7NuNERZero span
Named Entity Recognition (NER)CrossNERLiterature64.9NuNERZero span
Named Entity Recognition (NER)CrossNERMusic69.9NuNERZero span
Named Entity Recognition (NER)CrossNERPolitics71.7NuNERZero span
Named Entity Recognition (NER)CrossNERScience65.4NuNERZero span
Named Entity Recognition (NER)HarveyNEREntity F124.9NuNER Zero Span
Named Entity Recognition (NER)Broad Twitter CorpusEntity F160.2NuNerZero Span

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models2025-05-29Label-Guided In-Context Learning for Named Entity Recognition2025-05-29