Skip to main content
QUICK REVIEW

[论文解读] EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Yuqi Chen, Kan Ren|arXiv (Cornell University)|Jan 11, 2024
EEG and Brain-Computer Interfaces被引用 6
一句话总结

EEGFormer 通过对1.7TB的大规模无标签EEG数据进行离散向量量化 Transformer 的预训练,以学习通用、可迁移的表示,具有可解释的码本令牌,并展示了强大的下游表现和对新生儿癫痫检测的可迁移性。

ABSTRACT

Self-supervised learning has emerged as a highly effective approach in the fields of natural language processing and computer vision. It is also applicable to brain signals such as electroencephalography (EEG) data, given the abundance of available unlabeled data that exist in a wide spectrum of real-world medical applications ranging from seizure detection to wave analysis. The existing works leveraging self-supervised learning on EEG modeling mainly focus on pretraining upon each individual dataset corresponding to a single downstream task, which cannot leverage the power of abundant data, and they may derive sub-optimal solutions with a lack of generalization. Moreover, these methods rely on end-to-end model learning which is not easy for humans to understand. In this paper, we present a novel EEG foundation model, namely EEGFormer, pretrained on large-scale compound EEG data. The pretrained model cannot only learn universal representations on EEG signals with adaptable performance on various downstream tasks but also provide interpretable outcomes of the useful patterns within the data. To validate the effectiveness of our model, we extensively evaluate it on various downstream tasks and assess the performance under different transfer settings. Furthermore, we demonstrate how the learned model exhibits transferable anomaly detection performance and provides valuable interpretability of the acquired patterns via self-supervised learning.

研究动机与目标

  • 从大规模无标签数据中激发学习通用 EEG 表示,而不是数据集特定的预训练。
  • 为 EEG 信号开发一个离散表示(向量量化)预训练框架。
  • 研究 EEG 基础模型在多种下游任务和数据集上的可迁移性。
  • 通过分析学习到的离散码本和令牌,提供可解释的洞见。
  • 在基于 TUH 的下游任务上评估性能并迁移到新生儿癫痫检测。

提出的方法

  • 将 EEG 表示为多变量时间序列并分割成补丁。
  • 使用每个通道的 Transformer 编码器对补丁进行编码,并使用位置嵌入。
  • 用向量量化码本对编码器输出进行量化,以获得离散令牌。
  • 用一个浅层 Transformer 解码器对离散令牌进行解码以重建输入(重建损失加上 VQ 惩罚)。
  • 用一个联合目标进行训练,结合重建误差和向量量化损失(码本使用与承诺项)。
  • 对预训练的编码器/解码器(以及可选的码本)进行下游任务的微调。
Figure 1: Overview of EEGFormer . Initially, multi-variate EEG signals are segmented into patches, which are then passed through a Transformer encoder. Subsequently, a vector-quantized model is employed to generate discrete indices. These indices are then fed into a shallow Transformer decoder.
Figure 1: Overview of EEGFormer . Initially, multi-variate EEG signals are segmented into patches, which are then passed through a Transformer encoder. Subsequently, a vector-quantized model is employed to generate discrete indices. These indices are then fed into a shallow Transformer decoder.

实验结果

研究问题

  • RQ1在大规模 EEG 数据上进行向量量化、基于 Transformer 的预训练,是否能产生可跨越多种 EEG 任务的通用表示?
  • RQ2与端到端方法相比,离散表示学习是否能提高 EEG 模型的可解释性?
  • RQ3EEGFormer 在 TUH 之外的数据集(如新生儿癫痫检测)上的迁移能力有多强?

主要发现

ModelPretrainMetricTUABTUARTUSLTUSZNeonate
EEGNetAUROC0.841 ± .0110.752 ± .0060.635 ± .0150.820 ± .0300.793 ± .019
EEGNetAUPRC0.832 ± .0110.433 ± .0250.351 ± .0060.470 ± .0170.499 ± .044
TCNAUROC0.841 ± .0040.687 ± .0110.545 ± .0090.817 ± .0040.731 ± .020
TCNAUPRC0.831 ± .0020.408 ± .0090.344 ± .0010.383 ± .0100.398 ± .025
EEG-GNNAUROC0.840 ± .0050.837 ± .0220.721 ± .0090.780 ± .0060.760 ± .010
EEG-GNNAUPRC0.832 ± .0040.488 ± .0150.381 ± .0040.388 ± .0230.419 ± .021
GraphS4merAUROC0.864 ± .0060.833 ± .0060.632 ± .0170.822 ± .0340.719 ± .007
GraphS4merAUPRC0.862 ± .0080.461 ± .0240.359 ± .0010.491 ± .0010.374 ± .013
BrainBERTAUROC0.853 ± .0020.753 ± .0120.588 ± .0130.814 ± .0090.734 ± .019
BrainBERTAUPRC0.846 ± .0030.350 ± .0140.352 ± .0030.386 ± .0180.398 ± .027
EEGFormer sAUROC0.862 ± .0070.847 ± .0130.683 ± .0180.875 ± .0040.842 ± .008
EEGFormer sAUPRC0.862 ± .0050.488 ± .0120.397 ± .0110.553 ± .0140.578 ± .023
EEGFormer bAUROC0.865 ± .0010.847 ± .0140.713 ± .0100.878 ± .0060.842 ± .014
EEGFormer bAUPRC0.867 ± .0020.483 ± .0260.393 ± .0030.560 ± .0100.568 ± .036
EEGFormer lAUROC0.876 ± .0030.852 ± .0040.679 ± .0130.883 ± .0050.833 ± .017
EEGFormer lAUPRC0.872 ± .0010.483 ± .0140.389 ± .0030.556 ± .0080.544 ± .026
  • EEGFormer 的变体在 TUH 下游任务的 AUROC 和 AUPRC 指标上优于若干基线。
  • 预训练的轮数对下游性能有正向影响。
  • 使用 EEGFormer 进行线性探针已经能与部分有监督基线相媲美,端到端微调可取得最佳结果。
  • 离散码本实现了可解释性,对学习到的索引进行 n-gram 分析即可在不微调的情况下定位癫痫模式。
  • 在 Neonate 和 TUSZ 数据集上,与基线相比,EEGFormer 在 AUPRC 和 AUROC 上取得显著提升。
Figure 2: Influence of pretrain epochs on two TUH corpus.
Figure 2: Influence of pretrain epochs on two TUH corpus.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。