WikiBio
Wikipedia Biography Dataset
TextsCC BY-SA 3.0Introduced 2016-01-01
This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).