TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/GECToR -- Grammatical Error Correction: Tag, Not Rewrite

GECToR -- Grammatical Error Correction: Tag, Not Rewrite

Kostiantyn Omelianchuk, Vitaliy Atrasevych, Artem Chernodub, Oleksandr Skurzhanskyi

2020-05-26WS 2020 7TAGGrammatical Error Correction
PaperPDFCodeCode(official)Code

Abstract

In this paper, we present a simple and efficient GEC sequence tagger using a Transformer encoder. Our system is pre-trained on synthetic data and then fine-tuned in two stages: first on errorful corpora, and second on a combination of errorful and error-free parallel corpora. We design custom token-level transformations to map input tokens to target corrections. Our best single-model/ensemble GEC tagger achieves an $F_{0.5}$ of 65.3/66.5 on CoNLL-2014 (test) and $F_{0.5}$ of 72.4/73.6 on BEA-2019 (test). Its inference speed is up to 10 times as fast as a Transformer-based seq2seq GEC system. The code and trained models are publicly available.

Results

TaskDatasetMetricValueModel
Grammatical Error CorrectionCoNLL-2014 Shared TaskF0.566.5Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)
Grammatical Error CorrectionCoNLL-2014 Shared TaskPrecision78.2Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)
Grammatical Error CorrectionCoNLL-2014 Shared TaskRecall41.5Sequence tagging + token-level transformations + two-stage fine-tuning (+BERT, RoBERTa, XLNet)
Grammatical Error CorrectionCoNLL-2014 Shared TaskF0.565.3Sequence tagging + token-level transformations + two-stage fine-tuning (+XLNet)
Grammatical Error CorrectionCoNLL-2014 Shared TaskPrecision77.5Sequence tagging + token-level transformations + two-stage fine-tuning (+XLNet)
Grammatical Error CorrectionCoNLL-2014 Shared TaskRecall40.1Sequence tagging + token-level transformations + two-stage fine-tuning (+XLNet)
Grammatical Error CorrectionBEA-2019 (test)F0.573.7Sequence tagging + token-level transformations + two-stage fine-tuning (+RoBERTa, XLNet)
Grammatical Error CorrectionBEA-2019 (test)F0.572.4Sequence tagging + token-level transformations + two-stage fine-tuning (+XLNet)

Related Papers

CogniSQL-R1-Zero: Lightweight Reinforced Reasoning for Efficient SQL Generation2025-07-08End-to-End Spoken Grammatical Error Correction2025-06-23LLMs in Coding and their Impact on the Commercial Software Engineering Landscape2025-06-19How to Speak to a Real Person at Singapore Airlines®: 15 Easy Methods Explained2025-06-17Call To Speak To Someone At Frontier™️ Airlines Through Various Contact Options: The Ultimate Step Guide2025-06-17Call To Speak To Someone At Expedia Through Various Contact Options: The Ultimate Step Guide2025-06-1723 Ways to Contact How Do I Talk to Someone at Expedia®: A Step-by-Step Guide2025-06-17LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops2025-06-17