Illness-dataset

Illness multi-domain textual dataset

TextsMITIntroduced 2021-12-05

A dataset for evaluating text classification, domain adaptation, and active learning models. The dataset consists of 22,660 documents (tweets) collected in 2018 and 2019. It spans across four domains: Alzheimer's, Parkinson's, Cancer, and Diabetes.