Stronger Baselines for Grammatical Error Correction Using Pretrained Encoder-Decoder Model

Satoru Katsumata, Mamoru Komachi

2020-05-24Grammatical Error Correction

Abstract

Studies on grammatical error correction (GEC) have reported the effectiveness of pretraining a Seq2Seq model with a large amount of pseudodata. However, this approach requires time-consuming pretraining for GEC because of the size of the pseudodata. In this study, we explore the utility of bidirectional and auto-regressive transformers (BART) as a generic pretrained encoder-decoder model for GEC. With the use of this generic pretrained model for GEC, the time-consuming pretraining can be eliminated. We find that monolingual and multilingual BART models achieve high performance in GEC, with one of the results being comparable to the current strong results in English GEC. Our implementations are publicly available at GitHub (https://github.com/Katsumata420/generic-pretrained-GEC).

Results

Task	Dataset	Metric	Value	Model
Grammatical Error Correction	CoNLL-2014 Shared Task	F0.5	63	BART
Grammatical Error Correction	CoNLL-2014 Shared Task	Precision	69.9	BART
Grammatical Error Correction	CoNLL-2014 Shared Task	Recall	45.1	BART

Related Papers

End-to-End Spoken Grammatical Error Correction2025-06-23 IMPARA-GED: Grammatical Error Detection is Boosting Reference-free Grammatical Error Quality Estimator2025-06-03 Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction2025-05-27 gec-metrics: A Unified Library for Grammatical Error Correction Evaluation2025-05-26 Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models2025-05-09 Enriching the Korean Learner Corpus with Multi-reference Annotations and Rubric-Based Scoring2025-05-01 Deep Learning Model Deployment in Multiple Cloud Providers: an Exploratory Study Using Low Computing Power Environments2025-03-31 Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study2025-03-02