CELLS

TextsMIT licenseIntroduced 2022-11-07

CELLS is a large (63k pairs) and broadest-ranging (12 journals) parallel corpus for lay language generation. The abstract and the corresponding lay language summary are written by domain experts, assuring the quality of the dataset.

Source: CELLS: A Parallel Corpus for Biomedical Lay Language Generation

Image Source: https://arxiv.org/pdf/2211.03818v1.pdf