TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Read, Attend, and Code: Pushing the Limits of Medical Code...

Read, Attend, and Code: Pushing the Limits of Medical Codes Prediction from Clinical Notes by Machines

Byung-Hak Kim, Varun Ganapathi

2021-07-10Medical Code PredictionMulti-Label Classification Of Biomedical Texts
PaperPDF

Abstract

Prediction of medical codes from clinical notes is both a practical and essential need for every healthcare delivery organization within current medical systems. Automating annotation will save significant time and excessive effort spent by human coders today. However, the biggest challenge is directly identifying appropriate medical codes out of several thousands of high-dimensional codes from unstructured free-text clinical notes. In the past three years, with Convolutional Neural Networks (CNN) and Long Short-Term Memory (LTSM) networks, there have been vast improvements in tackling the most challenging benchmark of the MIMIC-III-full-label inpatient clinical notes dataset. This progress raises the fundamental question of how far automated machine learning (ML) systems are from human coders' working performance. We assessed the baseline of human coders' performance on the same subsampled testing set. We also present our Read, Attend, and Code (RAC) model for learning the medical code assignment mappings. By connecting convolved embeddings with self-attention and code-title guided attention modules, combined with sentence permutation-based data augmentations and stochastic weight averaging training, RAC establishes a new state of the art (SOTA), considerably outperforming the current best Macro-F1 by 18.7%, and reaches past the human-level coding baseline. This new milestone marks a meaningful step toward fully autonomous medical coding (AMC) in machines reaching parity with human coders' performance in medical code prediction.

Results

TaskDatasetMetricValueModel
Medical Code PredictionMIMIC-IIIMacro-AUC94.8RAC
Medical Code PredictionMIMIC-IIIMacro-F112.7RAC
Medical Code PredictionMIMIC-IIIMicro-AUC99.2RAC
Medical Code PredictionMIMIC-IIIMicro-F158.6RAC
Medical Code PredictionMIMIC-IIIPrecision@1560.1RAC
Medical Code PredictionMIMIC-IIIPrecision@582.9RAC
Medical Code PredictionMIMIC-IIIPrecision@875.4RAC
Multi-Label ClassificationMIMIC-IIIMacro-AUC94.8RAC
Multi-Label ClassificationMIMIC-IIIMacro-F112.7RAC
Multi-Label ClassificationMIMIC-IIIMicro-AUC99.2RAC
Multi-Label ClassificationMIMIC-IIIMicro-F158.6RAC
Multi-Label ClassificationMIMIC-IIIPrecision@1560.1RAC
Multi-Label ClassificationMIMIC-IIIPrecision@582.9RAC
Multi-Label ClassificationMIMIC-IIIPrecision@875.4RAC

Related Papers

Uncertainty-aware abstention in medical diagnosis based on medical texts2025-02-25An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records2024-06-13Effective Medical Code Prediction via Label Internal Alignment2023-05-09Automated Medical Coding on MIMIC-III and MIMIC-IV: A Critical Review and Replicability Study2023-04-21Can Current Explainability Help Provide References in Clinical Notes to Support Humans Annotate Medical Codes?2022-10-28Knowledge Injected Prompt Based Fine-tuning for Multi-label Few-shot ICD Coding2022-10-07Automatic ICD Coding Exploiting Discourse Structure and Reconciled Code Embeddings2022-10-01HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding2022-08-03