TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dreem Open Datasets: Multi-Scored Sleep Datasets to compar...

Dreem Open Datasets: Multi-Scored Sleep Datasets to compare Human and Automated sleep staging

Antoine Guillot, Fabien Sauvet, Emmanuel H. During, Valentin Thorey

2019-10-31Sleep StagingSleep Stage DetectionAutomatic Sleep Stage ClassificationMultimodal Sleep Stage Detection
PaperPDFCode(official)Code(official)

Abstract

Sleep stage classification constitutes an important element of sleep disorder diagnosis. It relies on the visual inspection of polysomnography records by trained sleep technologists. Automated approaches have been designed to alleviate this resource-intensive task. However, such approaches are usually compared to a single human scorer annotation despite an inter-rater agreement of about 85 % only. The present study introduces two publicly-available datasets, DOD-H including 25 healthy volunteers and DOD-O including 55 patients suffering from obstructive sleep apnea (OSA). Both datasets have been scored by 5 sleep technologists from different sleep centers. We developed a framework to compare automated approaches to a consensus of multiple human scorers. Using this framework, we benchmarked and compared the main literature approaches. We also developed and benchmarked a new deep learning method, SimpleSleepNet, inspired by current state-of-the-art. We demonstrated that many methods can reach human-level performance on both datasets. SimpleSleepNet achieved an F1 of 89.9 % vs 86.8 % on average for human scorers on DOD-H, and an F1 of 88.3 % vs 84.8 % on DOD-O. Our study highlights that using state-of-the-art automated sleep staging outperforms human scorers performance for healthy volunteers and patients suffering from OSA. Consideration could be made to use automated approaches in the clinical setting.

Results

TaskDatasetMetricValueModel
Sleep QualityDODHAccuracy89.9SimpleSleepNet
Sleep QualityDODHKappa84.6SimpleSleepNet
Sleep QualityDODHAccuracy89.6DeepSleepNet
Sleep QualityDODHKappa84.3DeepSleepNet
Sleep QualityDODOAccuracy88.7SimpleSleepNet
Sleep QualityDODOKappa82.3SimpleSleepNet
Sleep QualityDODOAccuracy87.5DeepSleepNet
Sleep QualityDODOKappa80.4DeepSleepNet
Sleep QualityDODOAccuracy85.5SeqSleepNet
Sleep QualityDODOKappa77.2SeqSleepNet
Sleep Stage DetectionDODHAccuracy89.9SimpleSleepNet
Sleep Stage DetectionDODHKappa84.6SimpleSleepNet
Sleep Stage DetectionDODHAccuracy89.6DeepSleepNet
Sleep Stage DetectionDODHKappa84.3DeepSleepNet
Sleep Stage DetectionDODOAccuracy88.7SimpleSleepNet
Sleep Stage DetectionDODOKappa82.3SimpleSleepNet
Sleep Stage DetectionDODOAccuracy87.5DeepSleepNet
Sleep Stage DetectionDODOKappa80.4DeepSleepNet
Sleep Stage DetectionDODOAccuracy85.5SeqSleepNet
Sleep Stage DetectionDODOKappa77.2SeqSleepNet

Related Papers

eegFloss: A Python package for refining sleep EEG recordings using machine learning models2025-07-08SLEEPYLAND: trust begins with fair evaluation of automatic sleep staging models2025-06-10From Sleep Staging to Spindle Detection: Evaluating End-to-End Automated Sleep Analysis2025-05-08PSG-MAE: Robust Multitask Sleep Event Monitoring using Multichannel PSG Reconstruction and Inter-channel Contrastive Learning2025-04-17PSDNorm: Test-Time Temporal Normalization for Deep Learning in Sleep Staging2025-03-06Vision Transformer Accelerator ASIC for Real-Time, Low-Power Sleep Staging2025-02-22Quasi Zigzag Persistence: A Topological Framework for Analyzing Time-Varying Data2025-02-22SleepGMUformer: A gated multimodal temporal neural network for sleep staging2025-02-20