Dhruv Kumar, Lili Mou, Lukasz Golab, Olga Vechtomova
We present a novel iterative, edit-based approach to unsupervised sentence simplification. Our model is guided by a scoring function involving fluency, simplicity, and meaning preservation. Then, we iteratively perform word and phrase-level edits on the complex sentence. Compared with previous approaches, our model does not require a parallel training set, but is more controllable and interpretable. Experiments on Newsela and WikiLarge datasets show that our approach is nearly as effective as state-of-the-art supervised approaches.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Simplification | TurkCorpus | BLEU | 73.62 | Edit-Unsup-TS |
| Text Simplification | TurkCorpus | SARI (EASSE>=0.2.1) | 37.85 | Edit-Unsup-TS |
| Text Simplification | Newsela | BLEU | 17.36 | Edit-Unsup-TS |
| Text Simplification | Newsela | SARI | 30.44 | Edit-Unsup-TS |