TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Adversarial Fine-tuning using Generated Respiratory Sound ...

Adversarial Fine-tuning using Generated Respiratory Sound to Address Class Imbalance

June-Woo Kim, Chihyeon Yoon, Miika Toikkanen, Sangmin Bae, Ho-Young Jung

2023-11-11Sound ClassificationAudio Classification
PaperPDFCode(official)

Abstract

Deep generative models have emerged as a promising approach in the medical image domain to address data scarcity. However, their use for sequential data like respiratory sounds is less explored. In this work, we propose a straightforward approach to augment imbalanced respiratory sound data using an audio diffusion model as a conditional neural vocoder. We also demonstrate a simple yet effective adversarial fine-tuning method to align features between the synthetic and real respiratory sound samples to improve respiratory sound classification performance. Our experimental results on the ICBHI dataset demonstrate that the proposed adversarial fine-tuning is effective, while only using the conventional augmentation method shows performance degradation. Moreover, our method outperforms the baseline by 2.24% on the ICBHI Score and improves the accuracy of the minority classes up to 26.58%. For the supplementary material, we provide the code at https://github.com/kaen2891/adversarial_fine-tuning_using_generated_respiratory_sound.

Results

TaskDatasetMetricValueModel
Audio ClassificationICBHI Respiratory Sound DatabaseICBHI Score61.79AFT on Mixed-500
Audio ClassificationICBHI Respiratory Sound DatabaseSensitivity42.86AFT on Mixed-500
Audio ClassificationICBHI Respiratory Sound DatabaseSpecificity80.72AFT on Mixed-500
ClassificationICBHI Respiratory Sound DatabaseICBHI Score61.79AFT on Mixed-500
ClassificationICBHI Respiratory Sound DatabaseSensitivity42.86AFT on Mixed-500
ClassificationICBHI Respiratory Sound DatabaseSpecificity80.72AFT on Mixed-500

Related Papers

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons2025-06-24USAD: Universal Speech and Audio Representation via Distillation2025-06-23Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier2025-06-23Acoustic scattering AI for non-invasive object classifications: A case study on hair assessment2025-06-17Disentangling Dual-Encoder Masked Autoencoder for Respiratory Sound Classification2025-06-12MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification2025-06-12