Congying Xia, Chenwei Zhang, Tao Yang, Yaliang Li, Nan Du, Xian Wu, Wei Fan, Fenglong Ma, Philip Yu
This paper presents a novel framework, MGNER, for Multi-Grained Named Entity Recognition where multiple entities or entity mentions in a sentence could be non-overlapping or totally nested. Different from traditional approaches regarding NER as a sequential labeling task and annotate entities consecutively, MGNER detects and recognizes entities on multiple granularities: it is able to recognize named entities without explicitly assuming non-overlapping or totally nested structures. MGNER consists of a Detector that examines all possible word segments and a Classifier that categorizes entities. In addition, contextual information and a self-attention mechanism are utilized throughout the framework to improve the NER performance. Experimental results show that MGNER outperforms current state-of-the-art baselines up to 4.4% in terms of the F1 score among nested/non-overlapping NER tasks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Named Entity Recognition (NER) | ACE 2004 | F1 | 79.5 | MGNER |
| Named Entity Recognition (NER) | ACE 2005 | F1 | 78.2 | MGNER |
| Named Entity Recognition (NER) | CoNLL 2003 (English) | F1 | 92.28 | MGNER |
| Named Entity Recognition (NER) | ACE 2005 | F1 | 78.2 | MGNER |
| Named Entity Recognition (NER) | ACE 2004 | F1 | 79.5 | MGNER |
| Nested Mention Recognition | ACE 2005 | F1 | 78.2 | MGNER |
| Nested Mention Recognition | ACE 2004 | F1 | 79.5 | MGNER |