TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/MIMIC-IV ICD-10

MIMIC-IV ICD-10

TextsPhysioNet Credentialed Health Data License 1.5.0Introduced 2023-04-21

MIMIC-IV ICD-10 contains 122,279 discharge summaries—free-text medical documents—annotated with ICD-10 diagnosis and procedure codes. It contains data for patients admitted to the Beth Israel Deaconess Medical Center emergency department or ICU between 2008-2019. All codes with fewer than ten examples have been removed, and the train-val-test split was created using [multi-label stratified sampling](MIMIC-IV ICD-9). The dataset is described further in Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study, and the code to use the dataset is found here.

The dataset is intended for medical code prediction and was created using MIMIC-IV v2.2 and MIMIC-IV-NOTE v2.2. Using the two datasets requires a license obtained in Physionet; this can take a couple of days.

Benchmarks

Medical Code Prediction/Precision@8Medical Code Prediction/F1 MacroMedical Code Prediction/F1 MicroMedical Code Prediction/Precision@15Medical Code Prediction/R-PrecMedical Code Prediction/mAPMedical Code Prediction/Exact Match RatioMedical Code Prediction/AUC MacroMedical Code Prediction/AUC MicroMulti-Label Classification/Precision@8Multi-Label Classification/F1 MacroMulti-Label Classification/F1 MicroMulti-Label Classification/Precision@15Multi-Label Classification/R-PrecMulti-Label Classification/mAPMulti-Label Classification/Exact Match RatioMulti-Label Classification/AUC MacroMulti-Label Classification/AUC Micro

Statistics

Papers
3
Benchmarks
18

Links

Homepage

Tasks

Medical Code PredictionMulti-Label Classification