GUE

Genome Understanding Evaluation

MedicalTextsIntroduced 2023-06-26

A collection of 2828 datasets across 77 tasks constructed for genome language model evaluation. Contains seven tasks: promoter prediction. core promoter prediction, splice site prediction, covid variant classification, epigenetic marks prediction, and transcription factor binding sites prediction on human and mouse.