QUICK REVIEW

[论文解读] Personalizing deep learning models for automatic sleep staging

Kaare B. Mikkelsen, Maarten De Vos|arXiv (Cornell University)|Jan 8, 2018

Sleep and Wakefulness Research参考文献 6被引用 30

一句话总结

本文提出一种个性化深度学习方法，通过在受试者首晚睡眠数据上微调预训练的卷积神经网络，以提升第二晚的睡眠分期分类准确率。该方法平均使性能提升约2个百分点，尤其在初始得分低于80%的受试者中效果更显著，表明结合广泛人群知识与个体适应性可增强模型的鲁棒性与泛化能力。

ABSTRACT

Despite continued advancement in machine learning algorithms and increasing availability of large data sets, there is still no universally acceptable solution for automatic sleep staging of human sleep recordings. One reason is that a skilled neurophysiologist scoring brain recordings of a sleeping person implicitly adapts his/her staging to the individual characteristics present in the brain recordings. Trying to incorporate this adaptation step in an automatic scoring algorithm, we introduce in this paper a method for personalizing a general sleep scoring model. Starting from a general convolutional neural network architecture, we allow the model to learn individual characteristics of the first night of sleep in order to quantify sleep stages of the second night. While the original neural network allows to sleep stage on a public database with a state of the art accuracy, personalizing the model further increases performance (on the order of two percentage points on average, but more for difficult subjects). This improvement is particularly present in subjects where the original algorithm did not perform well (typically subjects with accuracy less than $80\%$). Looking deeper, we find that optimal classification can be achieved when broad knowledge of sleep staging in general (at least 20 separate nights) is combined with subject-specific knowledge. We hypothesize that this method will be very valuable for improving scoring of lower quality sleep recordings, such as those from wearable devices.

研究动机与目标

为解决现有自动睡眠分期模型未能考虑个体受试者差异性的问题，而熟练的人类评分者可自然地适应这种差异。
探究迁移学习结合微调是否能提升深度学习在个体睡眠记录上的表现。
量化结合通用人群知识与个体适应性所带来的性能增益。
评估该方法在低质量或具有挑战性的睡眠记录（如可穿戴设备采集的数据）中的有效性。
探索实际临床部署场景，即仅需少量标注数据（如一次打分小睡）即可实现个性化。

提出的方法

使用包含19名受试者双晚记录的公开数据集，训练一个1D卷积神经网络，输入为EEG（Fpz-Cz）和EOG导联信号。
对每位受试者的首晚数据进行微调，使用相同的网络架构，但调整权重以适应个体特异性模式。
将输入数据扩展为每项预测包含三个连续的30秒时段（共90秒），以增强上下文感知能力。
每个时段标准化为单位标准差，并将原始标准差作为辅助输入传递至最终全连接层。
模型利用迁移学习：先在多样化数据上训练通用人群模型，再通过微调实现个体化。
在第二晚使用标准指标评估性能，并与无个性化的基线表现进行比较。

实验结果

研究问题

RQ1在单晚数据上微调预训练的深度学习模型，是否能显著提升个体受试者的自动睡眠分期准确率？
RQ2个性化是否在通用模型表现较差的受试者中（如准确率低于80%）带来更大的性能增益？
RQ3结合广泛人群知识与个体适应性在多大程度上提升了模型的鲁棒性与泛化能力？
RQ4个性化带来的性能提升是否在低质量或噪声较大的记录中（如可穿戴设备采集的数据）更为显著？
RQ5是否仅需极少标注数据（如一次打分小睡）即可实现个性化增益，从而支持实际临床部署？

主要发现

在受试者首晚睡眠数据上微调通用模型，使整个队列的平均准确率提升约2个百分点。
对于初始得分低于80%的受试者，性能增益最为显著，部分情况下提升超过2个百分点。
基线性能较低的受试者（如<80%）获得最大提升，表明个性化可纠正模型在困难病例中的局限性。
当至少20晚的广泛人群知识与个体适应性相结合时，分类性能达到最优。
对于初始得分已高于80%的受试者，微调未带来性能提升，表明此类情况下性能已接近评分者间一致性基准（约83%）。
结果表明，个性化在低质量记录（如可穿戴设备采集的数据）中尤为有价值，因为此类数据的初始模型表现通常较低。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。