QUICK REVIEW

[論文レビュー] A deep learning architecture for temporal sleep stage classification using multivariate and multimodal time series

Stanislas Chambon, Mathieu Galtier|arXiv (Cornell University)|Jul 5, 2017

EEG and Brain-Computer Interfaces参考文献 50被引用数 594

ひとこと要約

本論文は、心拍の多変量・多モーダル PSG 信号（EEG、EOG、EMG）から時系列の文脈を用いて睡眠段階を分類するエンドツーエンドの深層学習モデルを提案し、公開データで最先端の結果を達成した。

ABSTRACT

Sleep stage classification constitutes an important preliminary exam in the\ndiagnosis of sleep disorders. It is traditionally performed by a sleep expert\nwho assigns to each 30s of signal a sleep stage, based on the visual inspection\nof signals such as electroencephalograms (EEG), electrooculograms (EOG),\nelectrocardiograms (ECG) and electromyograms (EMG). We introduce here the first\ndeep learning approach for sleep stage classification that learns end-to-end\nwithout computing spectrograms or extracting hand-crafted features, that\nexploits all multivariate and multimodal Polysomnography (PSG) signals (EEG,\nEMG and EOG), and that can exploit the temporal context of each 30s window of\ndata. For each modality the first layer learns linear spatial filters that\nexploit the array of sensors to increase the signal-to-noise ratio, and the\nlast layer feeds the learnt representation to a softmax classifier. Our model\nis compared to alternative automatic approaches based on convolutional networks\nor decisions trees. Results obtained on 61 publicly available PSG records with\nup to 20 EEG channels demonstrate that our network architecture yields\nstate-of-the-art performance. Our study reveals a number of insights on the\nspatio-temporal distribution of the signal of interest: a good trade-off for\noptimal classification performance measured with balanced accuracy is to use 6\nEEG with 2 EOG (left and right) and 3 EMG chin channels. Also exploiting one\nminute of data before and after each data segment offers the strongest\nimprovement when a limited number of channels is available. As sleep experts,\nour system exploits the multivariate and multimodal nature of PSG signals in\norder to deliver state-of-the-art classification performance with a small\ncomputational cost.\n

研究の動機と目的

不均衡な多クラス予測問題として自動睡眠段階分類を動機づけ、解決する。
手作り特徴量を用いず、生の多変量 PSG 信号から学習するエンドツーエンドの深層ネットワークを提案する。
空間フィルタリングと時系列文脈を活用して頑健性と性能を向上させる。
最新手法とのベンチマークを行い、センサー構成と時系列文脈の影響を分析する。

提案手法

EEG/EOGおよびEMGそれぞれのモダリティに対して仮想チャンネルを作成する線形空間フィルタを学習するエンドツーエンドの深層アーキテクチャを導入する。
各モダリティからスペクトル特徴を抽出するため、畳み込みブロック、ReLU、最大プーリングを用いた別々のパイプラインを使用する。
モダリティ特異的特徴を結合し、ソフトマックス分類器に入力して睡眠段階の確率を出力する。
時間分布型の多変量ネットワークへ拡張し、隣接する 30 s 区間から特徴を集約して時系列文脈を取り入れる。
バランスの取れたミニバッチと Adam 最適化で学習し、頑健性のために早期停止を用いる。

実験結果

リサーチクエスチョン

RQ1生の多変量 PSG 信号からのエンドツーエンド学習は、手作り特徴量アプローチを睡眠段階分類で上回ることができるか？
RQ2空間的（多チャネル）情報の組み込みは分類性能と頑健性にどのように影響するか？
RQ3時系列文脈が分類精度とヒプノグラム遷移に与える影響は何か？
RQ4チャンネル数、追加モダリティ、および計算コストのトレードオフは何か？

主な発見

並列パイプラインでの多変量データ（EEG + EOG）とEMGを用いると、61件の公開PSGレコードで最先端の性能を達成。
より分布の良い追加の EEG チャンネルを加えると、ある点までバランス精度が向上するが、6チャネルを超えると追加モダリティが使われない限り利得は減少する。
EOGとEMGモダリティの組み込みは性能を向上させ、特に小さい EEG チャンネルセットで効果的。
時系列文脈は、空間文脈が制限されている場合に特定の段階（例：N1、N2、REM）の精度を改善するが、過度の時系列文脈は wake および N3 の性能を低下させ得る。
提案された多変量・多モーダルアーキテクチャは、ベースラインと比較して相対的に低い計算コストで高い性能を達成。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。