Xinyu Wang, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei Huang, Kewei Tu
Recent advances in Named Entity Recognition (NER) show that document-level contexts can significantly improve model performance. In many application scenarios, however, such contexts are not available. In this paper, we propose to find external contexts of a sentence by retrieving and selecting a set of semantically relevant texts through a search engine, with the original sentence as the query. We find empirically that the contextual representations computed on the retrieval-based input view, constructed through the concatenation of a sentence and its external contexts, can achieve significantly improved performance compared to the original input view based only on the sentence. Furthermore, we can improve the model performance of both input views by Cooperative Learning, a training method that encourages the two input views to produce similar contextual representations or output label distributions. Experiments show that our approach can achieve new state-of-the-art performance on 8 NER data sets across 5 domains.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Named Entity Recognition (NER) | NCBI-disease | F1 | 88.96 | CL-KL |
| Named Entity Recognition (NER) | WNUT 2017 | F1 | 60.45 | CL-KL |
| Named Entity Recognition (NER) | WNUT 2017 | F1 | 59.69 | BERT-CRF (Replicated in AdaSeq) |
| Named Entity Recognition (NER) | CoNLL 2003 (English) | F1 | 93.85 | CL-KL |
| Named Entity Recognition (NER) | CoNLL 2003 (English) | F1 | 93.35 | BERT-CRF (Replicated in AdaSeq) |
| Named Entity Recognition (NER) | CMeEE | F1 | 68.97 | BERT-CRF (Replicated in AdaSeq) |
| Named Entity Recognition (NER) | CoNLL++ | F1 | 94.81 | CL-KL |
| Named Entity Recognition (NER) | WNUT 2016 | F1 | 58.98 | CL-KL |
| Named Entity Recognition (NER) | BC5CDR | F1 | 90.99 | CL-L2 |
| Named Entity Recognition (NER) | Weibo NER | F1 | 72.77 | BERT-CRF (Replicated in AdaSeq) |
| Named Entity Recognition (NER) | MSRA | F1 | 96.69 | BERT-CRF (Replicated in AdaSeq) |
| Named Entity Recognition (NER) | Resume NER | F1 | 96.87 | BERT-CRF (Replicated in AdaSeq) |
| Chunking | CoNLL 2000 | Exact Span F1 | 97.18 | BERT-CRF (Replicated in AdaSeq) |
| Shallow Syntax | CoNLL 2000 | Exact Span F1 | 97.18 | BERT-CRF (Replicated in AdaSeq) |