WikiBio

Wikipedia Biography Dataset

TextsCC BY-SA 3.0Introduced 2016-01-01

This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).