Klexikon
Klexikon: A German Dataset for Joint Summarization and Simplification
TextsCC-BY-SAIntroduced 2022-01-18
The dataset introduces document alignments between German Wikipedia and the children's lexicon Klexikon. The source texts in Wikipedia are both written in a more complex language than Klexikon, and also significantly longer, which makes this a suitable application for both summarization and simplification. In fact, previous research has so far only focused on either of the two, but not comprehensively been studied as a joint task.