Boundary Smoothing for Named Entity Recognition

Enwei Zhu, Jinpeng Li

2022-04-26ACL 2022 5Nested Named Entity Recognition named-entity-recognition Named Entity Recognition Chinese Named Entity Recognition NER Named Entity Recognition (NER)

Paper PDF Code(official)

Abstract

Neural named entity recognition (NER) models may easily encounter the over-confidence issue, which degrades the performance and calibration. Inspired by label smoothing and driven by the ambiguity of boundary annotation in NER engineering, we propose boundary smoothing as a regularization technique for span-based neural NER models. It re-assigns entity probabilities from annotated spans to the surrounding ones. Built on a simple but strong baseline, our model achieves results better than or competitive with previous state-of-the-art systems on eight well-known NER benchmarks. Further empirical analysis suggests that boundary smoothing effectively mitigates over-confidence, improves model calibration, and brings flatter neural minima and more smoothed loss landscapes.

Results

Task	Dataset	Metric	Value	Model
Named Entity Recognition (NER)	Ontonotes v5 (English)	F1	91.74	Baseline + BS
Named Entity Recognition (NER)	CoNLL 2003 (English)	F1	93.65	Baseline + BS
Named Entity Recognition (NER)	ACE 2005	F1	87.15	Baseline + BS
Named Entity Recognition (NER)	ACE 2004	F1	87.98	Baseline + BS
Named Entity Recognition (NER)	Weibo NER	F1	72.66	Baseline + BS
Named Entity Recognition (NER)	MSRA	F1	96.26	Baseline + BS
Named Entity Recognition (NER)	Resume NER	F1	96.66	Baseline + BS
Named Entity Recognition (NER)	OntoNotes 4	F1	82.83	Baseline + BS

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08 Selecting and Merging: Towards Adaptable and Scalable Named Entity Recognition with Large Language Models2025-06-28 Improving Named Entity Transcription with Contextual LLM-based Revision2025-06-12 Better Semi-supervised Learning for Multi-domain ASR Through Incremental Retraining and Data Filtering2025-06-05 Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective2025-06-05 Efficient Data Selection for Domain Adaptation of ASR Using Pseudo-Labels and Multi-Stage Filtering2025-06-04 EL4NER: Ensemble Learning for Named Entity Recognition via Multiple Small-Parameter Large Language Models2025-05-29 Label-Guided In-Context Learning for Named Entity Recognition2025-05-29