Lenta Short Sentences

Texts

The Lenta Short Sentences dataset is a text dataset for language modelling for the Russian language. It consists of 236K sentences sampled from the Lenta News dataset.

Source: https://arxiv.org/pdf/2005.02470.pdf