RespireNet: A Deep Neural Network for Accurately Detecting Abnormal Lung Sounds in Limited Data Setting

Siddhartha Gairola, Francis Tom, Nipun Kwatra, Mohit Jain

2020-10-31Audio Classification

Abstract

Auscultation of respiratory sounds is the primary tool for screening and diagnosing lung diseases. Automated analysis, coupled with digital stethoscopes, can play a crucial role in enabling tele-screening of fatal lung diseases. Deep neural networks (DNNs) have shown a lot of promise for such problems, and are an obvious choice. However, DNNs are extremely data hungry, and the largest respiratory dataset ICBHI has only 6898 breathing cycles, which is still small for training a satisfactory DNN model. In this work, RespireNet, we propose a simple CNN-based model, along with a suite of novel techniques -- device specific fine-tuning, concatenation-based augmentation, blank region clipping, and smart padding -- enabling us to efficiently use the small-sized dataset. We perform extensive evaluation on the ICBHI dataset, and improve upon the state-of-the-art results for 4-class classification by 2.2%

Results

Task	Dataset	Metric	Value	Model
Audio Classification	ICBHI Respiratory Sound Database	ICBHI Score	56.2	ResNet-34
Audio Classification	ICBHI Respiratory Sound Database	Sensitivity	40.1	ResNet-34
Audio Classification	ICBHI Respiratory Sound Database	Specificity	72.3	ResNet-34
Classification	ICBHI Respiratory Sound Database	ICBHI Score	56.2	ResNet-34
Classification	ICBHI Respiratory Sound Database	Sensitivity	40.1	ResNet-34
Classification	ICBHI Respiratory Sound Database	Specificity	72.3	ResNet-34

Related Papers

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17 MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17 Neuromorphic Wireless Split Computing with Resonate-and-Fire Neurons2025-06-24 Fully Few-shot Class-incremental Audio Classification Using Multi-level Embedding Extractor and Ridge Regression Classifier2025-06-23 Adaptive Differential Denoising for Respiratory Sounds Classification2025-06-03 Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds2025-05-29 Patient-Aware Feature Alignment for Robust Lung Sound Classification:Cohesion-Separation and Global Alignment Losses2025-05-28 4,500 Seconds: Small Data Training Approaches for Deep UAV Audio Classification2025-05-21