TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Patch-Mix Contrastive Learning with Audio Spectrogram Tran...

Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification

Sangmin Bae, June-Woo Kim, Won-Yang Cho, Hyerim Baek, Soyoun Son, Byungjo Lee, Changwan Ha, Kyongpil Tae, Sungnyun Kim, Se-Young Yun

2023-05-23Sound ClassificationAudio ClassificationContrastive Learning
PaperPDFCode(official)

Abstract

Respiratory sound contains crucial information for the early diagnosis of fatal lung diseases. Since the COVID-19 pandemic, there has been a growing interest in contact-free medical care based on electronic stethoscopes. To this end, cutting-edge deep learning models have been developed to diagnose lung diseases; however, it is still challenging due to the scarcity of medical data. In this study, we demonstrate that the pretrained model on large-scale visual and audio datasets can be generalized to the respiratory sound classification task. In addition, we introduce a straightforward Patch-Mix augmentation, which randomly mixes patches between different samples, with Audio Spectrogram Transformer (AST). We further propose a novel and effective Patch-Mix Contrastive Learning to distinguish the mixed representations in the latent space. Our method achieves state-of-the-art performance on the ICBHI dataset, outperforming the prior leading score by an improvement of 4.08%.

Results

TaskDatasetMetricValueModel
Audio ClassificationICBHI Respiratory Sound DatabaseICBHI Score62.37AST (Patch-Mix CL)
Audio ClassificationICBHI Respiratory Sound DatabaseSensitivity43.07AST (Patch-Mix CL)
Audio ClassificationICBHI Respiratory Sound DatabaseSpecificity81.66AST (Patch-Mix CL)
Audio ClassificationICBHI Respiratory Sound DatabaseICBHI Score59.55AST (fine-tuning)
Audio ClassificationICBHI Respiratory Sound DatabaseSensitivity41.97AST (fine-tuning)
Audio ClassificationICBHI Respiratory Sound DatabaseSpecificity77.14AST (fine-tuning)
ClassificationICBHI Respiratory Sound DatabaseICBHI Score62.37AST (Patch-Mix CL)
ClassificationICBHI Respiratory Sound DatabaseSensitivity43.07AST (Patch-Mix CL)
ClassificationICBHI Respiratory Sound DatabaseSpecificity81.66AST (Patch-Mix CL)
ClassificationICBHI Respiratory Sound DatabaseICBHI Score59.55AST (fine-tuning)
ClassificationICBHI Respiratory Sound DatabaseSensitivity41.97AST (fine-tuning)
ClassificationICBHI Respiratory Sound DatabaseSpecificity77.14AST (fine-tuning)

Related Papers

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15