MedNLI

Medical Natural Language Inference

MedicalTexts

The MedNLI dataset consists of the sentence pairs developed by Physicians from the Past Medical History section of MIMIC-III clinical notes annotated for Definitely True, Maybe True and Definitely False. The dataset contains 11,232 training, 1,395 development and 1,422 test instances. This provides a natural language inference task (NLI) grounded in the medical history of patients.

Source: MT-Clinical BERT: Scaling Clinical Information Extraction with Multitask Learning Image Source: https://arxiv.org/abs/1904.02181