CURE
A dataset for Clinical Understanding & Retrieval Evaluation
TextsCreative Commons Attribution Non Commercial 4.0Introduced 2024-12-09
CURE is a retrieval dataset with a monolingual and two cross-lingual conditions, with splits spanning ten medical domains. Queries in CURE are natural language questions formulated by healthcare providers. They express the information needs of practitioners consulting academic literature in the course of their duties. Queries are available in English, French and Spanish. The corpus is constructed by mining an index of english passages extracted from biomedical academic articles.