Kor-Native

Native Korean Corpus

TextsIntroduced 2022-10-25

Kor-Learner is a Korean grammatical error correction (GEC) dataset collected grammatically from two sources, and the correct sentences were read using Google Text-to-Speech(TTS) system. The general public was tasked with dictating grammatically correct sentences and transcribe them. It contains more than 17K sentence pairs.

Source: Towards standardizing Korean Grammatical Error Correction: Datasets and Annotation

Image Source: https://arxiv.org/pdf/2210.14389v2.pdf