TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Structural Entities Extraction and Patient Indications Inc...

Structural Entities Extraction and Patient Indications Incorporation for Chest X-ray Report Generation

Kang Liu, Zhuoqi Ma, Xiaolu Kang, Zhusi Zhong, Zhicheng Jiao, Grayson Baird, Harrison Bai, Qiguang Miao

2024-05-23Medical Report GenerationText Generationcross-modal alignmentDiagnostic
PaperPDFCode(official)

Abstract

The automated generation of imaging reports proves invaluable in alleviating the workload of radiologists. A clinically applicable reports generation algorithm should demonstrate its effectiveness in producing reports that accurately describe radiology findings and attend to patient-specific indications. In this paper, we introduce a novel method, \textbf{S}tructural \textbf{E}ntities extraction and patient indications \textbf{I}ncorporation (SEI) for chest X-ray report generation. Specifically, we employ a structural entities extraction (SEE) approach to eliminate presentation-style vocabulary in reports and improve the quality of factual entity sequences. This reduces the noise in the following cross-modal alignment module by aligning X-ray images with factual entity sequences in reports, thereby enhancing the precision of cross-modal alignment and further aiding the model in gradient-free retrieval of similar historical cases. Subsequently, we propose a cross-modal fusion network to integrate information from X-ray images, similar historical cases, and patient-specific indications. This process allows the text decoder to attend to discriminative features of X-ray images, assimilate historical diagnostic information from similar cases, and understand the examination intention of patients. This, in turn, assists in triggering the text decoder to produce high-quality reports. Experiments conducted on MIMIC-CXR validate the superiority of SEI over state-of-the-art approaches on both natural language generation and clinical efficacy metrics.

Results

TaskDatasetMetricValueModel
Medical Report GenerationMIMIC-CXRBLEU-20.247SEI-1
Medical Report GenerationMIMIC-CXRBLEU-40.135SEI-1
Medical Report GenerationMIMIC-CXRExample-F1-140.46SEI-1
Medical Report GenerationMIMIC-CXRF1 RadGraph0.249SEI-1
Medical Report GenerationMIMIC-CXRMETEOR0.158SEI-1
Medical Report GenerationMIMIC-CXRMicro-F1-50.542SEI-1
Medical Report GenerationMIMIC-CXRROUGE-L0.299SEI-1

Related Papers

Smart fault detection in satellite electrical power system2025-07-18Making Language Model a Hierarchical Classifier and Generator2025-07-17Transformer-based Spatial Grounding: A Comprehensive Survey2025-07-17Demographic-aware fine-grained classification of pediatric wrist fractures2025-07-17Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16Trustworthy Tree-based Machine Learning by $MoS_2$ Flash-based Analog CAM with Inherent Soft Boundaries2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15