Enwei Zhu, Jinpeng Li
Neural named entity recognition (NER) models may easily encounter the over-confidence issue, which degrades the performance and calibration. Inspired by label smoothing and driven by the ambiguity of boundary annotation in NER engineering, we propose boundary smoothing as a regularization technique for span-based neural NER models. It re-assigns entity probabilities from annotated spans to the surrounding ones. Built on a simple but strong baseline, our model achieves results better than or competitive with previous state-of-the-art systems on eight well-known NER benchmarks. Further empirical analysis suggests that boundary smoothing effectively mitigates over-confidence, improves model calibration, and brings flatter neural minima and more smoothed loss landscapes.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Named Entity Recognition (NER) | Ontonotes v5 (English) | F1 | 91.74 | Baseline + BS |
| Named Entity Recognition (NER) | CoNLL 2003 (English) | F1 | 93.65 | Baseline + BS |
| Named Entity Recognition (NER) | ACE 2005 | F1 | 87.15 | Baseline + BS |
| Named Entity Recognition (NER) | ACE 2004 | F1 | 87.98 | Baseline + BS |
| Named Entity Recognition (NER) | Weibo NER | F1 | 72.66 | Baseline + BS |
| Named Entity Recognition (NER) | MSRA | F1 | 96.26 | Baseline + BS |
| Named Entity Recognition (NER) | Resume NER | F1 | 96.66 | Baseline + BS |
| Named Entity Recognition (NER) | OntoNotes 4 | F1 | 82.83 | Baseline + BS |