TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Reproducing and Improving CheXNet: Deep Learning for Chest...

Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification

Daniel Strick, Carlos Garcia, Anthony Huang

2025-05-10Multi-Label Classification
PaperPDFCode(official)

Abstract

Deep learning for radiologic image analysis is a rapidly growing field in biomedical research and is likely to become a standard practice in modern medicine. On the publicly available NIH ChestX-ray14 dataset, containing X-ray images that are classified by the presence or absence of 14 different diseases, we reproduced an algorithm known as CheXNet, as well as explored other algorithms that outperform CheXNet's baseline metrics. Model performance was primarily evaluated using the F1 score and AUC-ROC, both of which are critical metrics for imbalanced, multi-label classification tasks in medical imaging. The best model achieved an average AUC-ROC score of 0.85 and an average F1 score of 0.39 across all 14 disease classifications present in the dataset.

Results

TaskDatasetMetricValueModel
Multi-Label ClassificationChestX-ray14Average AUC on 14 label85.266Improved CheXNet (DannyNet, dstrick17 et al., 2025)
Multi-Label ClassificationChestX-ray14Macro F10.38605Improved CheXNet (DannyNet, dstrick17 et al., 2025)

Related Papers

Privacy-Preserving Chest X-ray Classification in Latent Space with Homomorphically Encrypted Neural Inference2025-06-18Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation2025-06-17AgriPotential: A Novel Multi-Spectral and Multi-Temporal Remote Sensing Dataset for Agricultural Potentials2025-06-13MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification2025-06-12ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese2025-06-11Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis2025-06-05PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches2025-05-30Efficient Text Encoders for Labor Market Analysis2025-05-30