Betty van Aken, Jens-Michalis Papaioannou, Manuel Mayrdorfer, Klemens Budde, Felix A. Gers, Alexander Löser
Outcome prediction from clinical text can prevent doctors from overlooking possible risks and help hospitals to plan capacities. We simulate patients at admission time, when decision support can be especially valuable, and contribute a novel admission to discharge task with four common outcome prediction targets: Diagnoses at discharge, procedures performed, in-hospital mortality and length-of-stay prediction. The ideal system should infer outcomes based on symptoms, pre-conditions and risk factors of a patient. We evaluate the effectiveness of language models to handle this scenario and propose clinical outcome pre-training to integrate knowledge about patient outcomes from multiple public sources. We further present a simple method to incorporate ICD code hierarchy into the models. We show that our approach improves performance on the outcome tasks against several baselines. A detailed analysis reveals further strengths of the model, including transferability, but also weaknesses such as handling of vital values and inconsistencies in the underlying data.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Length-of-Stay prediction | Clinical Admission Notes from MIMIC-III | AUROC | 72.53 | CORe |
| Length-of-Stay prediction | Clinical Admission Notes from MIMIC-III | AUROC | 71.59 | BioBERT Base |
| Length-of-Stay prediction | Clinical Admission Notes from MIMIC-III | AUROC | 70.4 | BERT Base |
| Medical Diagnosis | Clinical Admission Notes from MIMIC-III | AUROC | 83.54 | CORe |
| Medical Diagnosis | Clinical Admission Notes from MIMIC-III | AUROC | 82.81 | BioBERT Base |
| Electrocardiography (ECG) | Clinical Admission Notes from MIMIC-III | AUROC | 84.04 | CORe |
| Electrocardiography (ECG) | Clinical Admission Notes from MIMIC-III | AUROC | 82.55 | BioBERT Base |
| Electrocardiography (ECG) | Clinical Admission Notes from MIMIC-III | AUROC | 81.13 | BERT Base |
| Mortality Prediction | Clinical Admission Notes from MIMIC-III | AUROC | 84.04 | CORe |
| Mortality Prediction | Clinical Admission Notes from MIMIC-III | AUROC | 82.55 | BioBERT Base |
| Mortality Prediction | Clinical Admission Notes from MIMIC-III | AUROC | 81.13 | BERT Base |
| Medical Procedure | Clinical Admission Notes from MIMIC-III | AUROC | 88.37 | CORe |
| Medical Procedure | Clinical Admission Notes from MIMIC-III | AUROC | 86.36 | BioBERT Base |
| Medical Procedure | Clinical Admission Notes from MIMIC-III | AUROC | 85.84 | BERT Base |
| Medical waveform analysis | Clinical Admission Notes from MIMIC-III | AUROC | 84.04 | CORe |
| Medical waveform analysis | Clinical Admission Notes from MIMIC-III | AUROC | 82.55 | BioBERT Base |
| Medical waveform analysis | Clinical Admission Notes from MIMIC-III | AUROC | 81.13 | BERT Base |