TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CheXGenBench: A Unified Benchmark For Fidelity, Privacy an...

CheXGenBench: A Unified Benchmark For Fidelity, Privacy and Utility of Synthetic Chest Radiographs

Raman Dutt, Pedro Sanchez, Yongchen Yao, Steven McDonagh, Sotirios A. Tsaftaris, Timothy Hospedales

2025-05-15Conditional Text-to-Image Synthesis
PaperPDFCode(official)

Abstract

We introduce CheXGenBench, a rigorous and multifaceted evaluation framework for synthetic chest radiograph generation that simultaneously assesses fidelity, privacy risks, and clinical utility across state-of-the-art text-to-image generative models. Despite rapid advancements in generative AI for real-world imagery, medical domain evaluations have been hindered by methodological inconsistencies, outdated architectural comparisons, and disconnected assessment criteria that rarely address the practical clinical value of synthetic samples. CheXGenBench overcomes these limitations through standardised data partitioning and a unified evaluation protocol comprising over 20 quantitative metrics that systematically analyse generation quality, potential privacy vulnerabilities, and downstream clinical applicability across 11 leading text-to-image architectures. Our results reveal critical inefficiencies in the existing evaluation protocols, particularly in assessing generative fidelity, leading to inconsistent and uninformative comparisons. Our framework establishes a standardised benchmark for the medical AI community, enabling objective and reproducible comparisons while facilitating seamless integration of both existing and future generative models. Additionally, we release a high-quality, synthetic dataset, SynthCheX-75K, comprising 75K radiographs generated by the top-performing model (Sana 0.6B) in our benchmark to support further research in this critical domain. Through CheXGenBench, we establish a new state-of-the-art and release our framework, models, and SynthCheX-75K dataset at https://raman1121.github.io/CheXGenBench/

Results

TaskDatasetMetricValueModel
Image GenerationMIMIC-CXRFID (RadDino)54.22Sana
Image GenerationMIMIC-CXRFID (RadDino)60.15Pixart Sigma
Image GenerationMIMIC-CXRFID (RadDino)69.69RadEdit
Image GenerationMIMIC-CXRFID (RadDino)71.24LLM-CXR
Image GenerationMIMIC-CXRFID (RadDino)74.58SD V3.5 Medium (LoRA r128)
Image GenerationMIMIC-CXRFID (RadDino)88.28Lumina 2.0 (LoRA r128)
Image GenerationMIMIC-CXRFID (RadDino)93.1SD V3.5 Medium (LoRA r32)
Image GenerationMIMIC-CXRFID (RadDino)101.19Lumina 2.0 (LoRA r32)
Image GenerationMIMIC-CXRFID (RadDino)118.93SD V1-5
Image GenerationMIMIC-CXRFID (RadDino)122.4Flux.1-Dev (LoRA r32)
Image GenerationMIMIC-CXRFID (RadDino)125.18SD V1-4
Image GenerationMIMIC-CXRFID (RadDino)186.53SD V2-1
Image GenerationMIMIC-CXRFID (RadDino)194.72SD V2
Text-to-Image GenerationMIMIC-CXRFID (RadDino)54.22Sana
Text-to-Image GenerationMIMIC-CXRFID (RadDino)60.15Pixart Sigma
Text-to-Image GenerationMIMIC-CXRFID (RadDino)69.69RadEdit
Text-to-Image GenerationMIMIC-CXRFID (RadDino)71.24LLM-CXR
Text-to-Image GenerationMIMIC-CXRFID (RadDino)74.58SD V3.5 Medium (LoRA r128)
Text-to-Image GenerationMIMIC-CXRFID (RadDino)88.28Lumina 2.0 (LoRA r128)
Text-to-Image GenerationMIMIC-CXRFID (RadDino)93.1SD V3.5 Medium (LoRA r32)
Text-to-Image GenerationMIMIC-CXRFID (RadDino)101.19Lumina 2.0 (LoRA r32)
Text-to-Image GenerationMIMIC-CXRFID (RadDino)118.93SD V1-5
Text-to-Image GenerationMIMIC-CXRFID (RadDino)122.4Flux.1-Dev (LoRA r32)
Text-to-Image GenerationMIMIC-CXRFID (RadDino)125.18SD V1-4
Text-to-Image GenerationMIMIC-CXRFID (RadDino)186.53SD V2-1
Text-to-Image GenerationMIMIC-CXRFID (RadDino)194.72SD V2
10-shot image generationMIMIC-CXRFID (RadDino)54.22Sana
10-shot image generationMIMIC-CXRFID (RadDino)60.15Pixart Sigma
10-shot image generationMIMIC-CXRFID (RadDino)69.69RadEdit
10-shot image generationMIMIC-CXRFID (RadDino)71.24LLM-CXR
10-shot image generationMIMIC-CXRFID (RadDino)74.58SD V3.5 Medium (LoRA r128)
10-shot image generationMIMIC-CXRFID (RadDino)88.28Lumina 2.0 (LoRA r128)
10-shot image generationMIMIC-CXRFID (RadDino)93.1SD V3.5 Medium (LoRA r32)
10-shot image generationMIMIC-CXRFID (RadDino)101.19Lumina 2.0 (LoRA r32)
10-shot image generationMIMIC-CXRFID (RadDino)118.93SD V1-5
10-shot image generationMIMIC-CXRFID (RadDino)122.4Flux.1-Dev (LoRA r32)
10-shot image generationMIMIC-CXRFID (RadDino)125.18SD V1-4
10-shot image generationMIMIC-CXRFID (RadDino)186.53SD V2-1
10-shot image generationMIMIC-CXRFID (RadDino)194.72SD V2
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)54.22Sana
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)60.15Pixart Sigma
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)69.69RadEdit
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)71.24LLM-CXR
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)74.58SD V3.5 Medium (LoRA r128)
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)88.28Lumina 2.0 (LoRA r128)
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)93.1SD V3.5 Medium (LoRA r32)
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)101.19Lumina 2.0 (LoRA r32)
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)118.93SD V1-5
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)122.4Flux.1-Dev (LoRA r32)
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)125.18SD V1-4
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)186.53SD V2-1
1 Image, 2*2 StitchiMIMIC-CXRFID (RadDino)194.72SD V2

Related Papers

Test-time Conditional Text-to-Image Synthesis Using Diffusion Models2024-11-16Zero-Painter: Training-Free Layout Control for Text-to-Image Synthesis2024-06-06MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis2024-02-08InstanceDiffusion: Instance-level Control for Image Generation2024-02-05BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion2023-07-20Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion Models2023-05-25LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis2023-05-19GLIGEN: Open-Set Grounded Text-to-Image Generation2023-01-17