Kor-Lang8

Lang-8 Korean Corpus

TextsIntroduced 2022-10-25

Kor-Lang8 is a Korean grammatical error correction (GEC) dataset extracted from the NAIST Lang-8 Learner Corpora by the language label. It contains more than 109K sentence pairs.

Source: Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Image Source: https://arxiv.org/pdf/2210.14389v2.pdf