TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Domain Knowledge Integrated CNN-xLSTM-xAtt Network with Mu...

Domain Knowledge Integrated CNN-xLSTM-xAtt Network with Multi Stream Feature Fusion for Cuffless Blood Pressure Estimation from Photoplethysmography Signals

Md Shoaib Akhter Rafi, Md Kamrul Hasan

2025-05-1308/15 2025 5Blood pressure estimationPhotoplethysmography (PPG)
PaperPDFCode

Abstract

Estimating blood pressure (BP) from photoplethysmography (PPG) signals is challenging due to signal variability and noise, as well as the complex relationship between PPG and BP, which requires sophisticated algorithms and personalization to achieve high accuracy. This paper introduces a novel deep learning (DL)-based framework, which includes a CNN-xLSTM-xAtt-based deep neural network for extracting multi-domain features, and a multi-stream feature fusion network (MSFN) with a signal enhancer for BP estimation from PPG signals. The performance of this framework is further enhanced by incorporating domain knowledge into the DL network via an ML-based loss function. The data preprocessing step incorporates a novel signal selector that accepts signals with peak height and distance variance within a specified range. To ensure robust feature learning from the peaks, a convolution-based peak enhancement (CPE) method has been employed in this work. For feature extraction, the refined signals are processed using two parallel networks, enabling complementary feature learning from both the uni-dimensional signal and image domains. The uni-dimensional path of the proposed framework, containing two parallel networks–ConvLSTM and xLSTM–facilitate both short- and long-term temporal dependencies. Concurrently, the image-based network extracts morphological and spectral features from three different image modalities, utilizing the benefits of pretrained networks. To ensure the framework focuses on the most important source in this multi-stream network, a novel multi-stream spatial- and cross-attention network (M-SCAN) is proposed. Finally, combining both type of features by a CNN-xLSTM-based multi stream fusion network (MSFN) provides an estimation of BP. During the training of the DL network, a domain knowledge-integrated dynamic quantitative embedding supervision-based tuning (D-QuEST) is also proposed as a supervision loss for the model. The performance of the proposed framework has been tested on two publicly available datasets suggesting 0.05% MSE, 1.40% MAE, and 99.36% PCC on the small and 0.06% MSE, 1.58% MAE, and 99.09% PCC on the large dataset. The proposed framework outperforms existing state-of-the-art methods by a significant margin, even with high subject diversity. This demonstrates its potential for accurate BP estimation solely from PPG signals with diverse variations.

Related Papers

CAST-Phys: Contactless Affective States Through Physiological signals Database2025-07-08Uncertainty quantification with approximate variational learning for wearable photoplethysmography prediction tasks2025-05-16A Dataset and Toolkit for Multiparameter Cardiovascular Physiology Sensing on Rings2025-05-07Contactless pulse rate assessment: Results and insights for application in driving simulator2025-05-02Generalised Label-free Artefact Cleaning for Real-time Medical Pulsatile Time Series2025-04-29PsyCounAssist: A Full-Cycle AI-Powered Psychological Counseling Assistant System2025-04-23Reliable Physiological Monitoring on the Wrist Using Generative Deep Learning to Address Poor Skin-Sensor Contact2025-04-03Exploring Reliable PPG Authentication on Smartwatches in Daily Scenarios2025-03-31