CELEX

TextsCustom (research-only)

CELEX database comprises three different searchable lexical databases, Dutch, English and German. The lexical data contained in each database is divided into five categories: orthography, phonology, morphology, syntax (word class) and word frequency.

Source: Polysemy and Brevity versus Frequency in Language Image Source: https://www.aclweb.org/anthology/W17-7619.pdf