TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Text Simplification by Tagging

Text Simplification by Tagging

Kostiantyn Omelianchuk, Vipul Raheja, Oleksandr Skurzhanskyi

2021-03-08EACL (BEA) 2021 4Text Simplification
PaperPDFCode(official)

Abstract

Edit-based approaches have recently shown promising results on multiple monolingual sequence transduction tasks. In contrast to conventional sequence-to-sequence (Seq2Seq) models, which learn to generate text from scratch as they are trained on parallel corpora, these methods have proven to be much more effective since they are able to learn to make fast and accurate transformations while leveraging powerful pre-trained language models. Inspired by these ideas, we present TST, a simple and efficient Text Simplification system based on sequence Tagging, leveraging pre-trained Transformer-based encoders. Our system makes simplistic data augmentations and tweaks in training and inference on a pre-existing system, which makes it less reliant on large amounts of parallel training data, provides more control over the outputs and enables faster inference speeds. Our best model achieves near state-of-the-art performance on benchmark test datasets for the task. Since it is fully non-autoregressive, it achieves faster inference speeds by over 11 times than the current state-of-the-art text simplification system.

Results

TaskDatasetMetricValueModel
Text SimplificationTurkCorpusSARI (EASSE>=0.2.1)41.46TST
Text SimplificationASSETSARI (EASSE>=0.2.1)43.21TST
Text SimplificationPWKP / WikiSmallSARI44.67TST
Text SimplificationPWKP / WikiSmallSARI (EASSE>=0.2.1)44.67TST

Related Papers

Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport2025-05-29Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data2025-05-28Automated Feedback Loops to Protect Text Simplification with Generative AI from Information Loss2025-05-22Resource for Error Analysis in Text Simplification: New Taxonomy and Test Collection2025-05-22A Structured Literature Review on Traditional Approaches in Current Natural Language Processing2025-05-19LLM-based Text Simplification and its Effect on User Comprehension and Cognitive Load2025-05-04SimplifyMyText: An LLM-Based System for Inclusive Plain Language Text Simplification2025-04-19Evaluation Under Imperfect Benchmarks and Ratings: A Case Study in Text Simplification2025-04-13