TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Medical Code Prediction from Discharge Summary: Document t...

Medical Code Prediction from Discharge Summary: Document to Sequence BERT using Sequence Attention

Tak-Sung Heo, Yongmin Yoo, Yeongjoon Park, Byeong-Cheol Jo, Kyungsun Kim

2021-06-15Medical DiagnosisMedical Code PredictionMulti-Label Text Classification
PaperPDFCode(official)

Abstract

Clinical notes are unstructured text generated by clinicians during patient encounters. Clinical notes are usually accompanied by a set of metadata codes from the International Classification of Diseases(ICD). ICD code is an important code used in various operations, including insurance, reimbursement, medical diagnosis, etc. Therefore, it is important to classify ICD codes quickly and accurately. However, annotating these codes is costly and time-consuming. So we propose a model based on bidirectional encoder representations from transformers (BERT) using the sequence attention method for automatic ICD code assignment. We evaluate our approach on the medical information mart for intensive care III (MIMIC-III) benchmark dataset. Our model achieved performance of macro-averaged F1: 0.62898 and micro-averaged F1: 0.68555 and is performing better than a performance of the state-of-the-art model using the MIMIC-III dataset. The contribution of this study proposes a method of using BERT that can be applied to documents and a sequence attention method that can capture important sequence in-formation appearing in documents.

Results

TaskDatasetMetricValueModel
Multi-Label Text ClassificationMIMIC-III-50Micro-F168.555D2SBERT using Sequence Attention
Text ClassificationMIMIC-III-50Micro-F168.555D2SBERT using Sequence Attention
ClassificationMIMIC-III-50Micro-F168.555D2SBERT using Sequence Attention

Related Papers

Hear Your Code Fail, Voice-Assisted Debugging for Python2025-07-20AI-Driven Cytomorphology Image Synthesis for Medical Diagnostics2025-07-07AI-Driven MRI-based Brain Tumour Segmentation Benchmarking2025-06-25A Framework for Uncertainty Quantification Based on Nearest Neighbors Across Layers2025-06-24MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration2025-06-24A Real-time Endoscopic Image Denoising System2025-06-18Dr. GPT Will See You Now, but Should It? Exploring the Benefits and Harms of Large Language Models in Medical Diagnosis using Crowdsourced Clinical Cases2025-06-13Taming Stable Diffusion for Computed Tomography Blind Super-Resolution2025-06-13