Marek Rei, Helen Yannakoudakis
We investigate the utility of different auxiliary objectives and training strategies within a neural sequence labeling approach to error detection in learner writing. Auxiliary costs provide the model with additional linguistic information, allowing it to learn general-purpose compositional features that can then be exploited for other objectives. Our experiments show that a joint learning approach trained with parallel labels on in-domain data improves performance over the previous best error detection system. While the resulting model has the same number of parameters, the additional objectives allow it to be optimised more efficiently and achieve better performance.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Grammatical Error Correction | CoNLL-2014 A1 | F0.5 | 36.1 | Bi-LSTM + POS (unrestricted data) |
| Grammatical Error Correction | CoNLL-2014 A1 | F0.5 | 17.5 | Bi-LSTM + POS (trained on FCE) |
| Grammatical Error Correction | CoNLL-2014 A2 | F0.5 | 45.1 | Bi-LSTM + POS (unrestricted data) |
| Grammatical Error Correction | CoNLL-2014 A2 | F0.5 | 26.2 | Bi-LSTM + POS (trained on FCE) |
| Grammatical Error Correction | FCE | F0.5 | 47.7 | Bi-LSTM + err POS GR |