Giannis Bekoulis, Johannes Deleu, Thomas Demeester, Chris Develder
Adversarial training (AT) is a regularization method that can be used to improve the robustness of neural network methods by adding small perturbations in the training data. We show how to use AT for the tasks of entity recognition and relation extraction. In particular, we demonstrate that applying AT to a general purpose baseline model for jointly extracting entities and relations, allows improving the state-of-the-art effectiveness on several datasets in different contexts (i.e., news, biomedical, and real estate data) and for different languages (English and Dutch).
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Relation Extraction | ACE 2004 | NER Micro F1 | 81.64 | multi-head + AT |
| Relation Extraction | ACE 2004 | RE+ Micro F1 | 47.45 | multi-head + AT |
| Relation Extraction | Adverse Drug Events (ADE) Corpus | NER Macro F1 | 86.73 | multi-head + AT |
| Relation Extraction | Adverse Drug Events (ADE) Corpus | RE+ Macro F1 | 75.52 | multi-head + AT |
| Relation Extraction | CoNLL04 | NER Macro F1 | 83.6 | multi-head + AT |
| Relation Extraction | CoNLL04 | RE+ Macro F1 | 61.95 | multi-head + AT |