Kor-Learner

Korean Learner Corpus

TextsIntroduced 2022-10-25

Kor-Learner is a Korean grammatical error correction (GEC) dataset made from the NIKL learner corpus containing essays written by Korean learners and their grammatical error correction annotations by their tutors in an morpheme-level XML file format. It contains more than 28K sentence pairs.

Source: Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Image Source: https://arxiv.org/pdf/2210.14389v2.pdf