Andrea Galassi, Marco Lippi, Paolo Torroni
We explore the use of residual networks and neural attention for multiple argument mining tasks. We propose a residual architecture that exploits attention, multi-task learning, and makes use of ensemble, without any assumption on document or argument structure. We present an extensive experimental evaluation on five different corpora of user-generated comments, scientific publications, and persuasive essays. Our results show that our approach is a strong competitor against state-of-the-art architectures with a higher computational footprint or corpus-specific design, representing an interesting compromise between generality, performance accuracy and reduced model size.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Link Prediction | DRI Corpus | F1 | 43.66 | ResAttArg |
| Link Prediction | CDCP | F1 | 29.73 | ResAttArg |
| Link Prediction | AbstRCT - Neoplasm | F1 | 54.43 | ResAttArg |
| Relation Extraction | AbstRCT - Neoplasm | Macro F1 | 70.92 | ResAttArg |
| Relation Extraction | CDCP | Macro F1 | 42.95 | ResAttArg |
| Relation Extraction | DRI Corpus | Macro F1 | 37.72 | ResAttArg |
| Data Mining | CDCP | Macro F1 | 78.71 | ResAttArg |
| Relation Classification | AbstRCT - Neoplasm | Macro F1 | 70.92 | ResAttArg |
| Relation Classification | CDCP | Macro F1 | 42.95 | ResAttArg |
| Relation Classification | DRI Corpus | Macro F1 | 37.72 | ResAttArg |
| Interpretable Machine Learning | CDCP | Macro F1 | 78.71 | ResAttArg |
| Argument Mining | CDCP | Macro F1 | 78.71 | ResAttArg |