Kor-Learner
Korean Learner Corpus
TextsIntroduced 2022-10-25
Kor-Learner is a Korean grammatical error correction (GEC) dataset made from the NIKL learner corpus containing essays written by Korean learners and their grammatical error correction annotations by their tutors in an morpheme-level XML file format. It contains more than 28K sentence pairs.
Source: Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation
Image Source: https://arxiv.org/pdf/2210.14389v2.pdf