QUICK REVIEW

[论文解读] Deep Convolutional Architectures for EEG Classification: A Comparative Study with Temporal Augmentation and Confidence-Based Voting

Aryan Patodiya, Hubert Cecotti|arXiv (Cornell University)|Feb 25, 2026

EEG and Brain-Computer Interfaces被引用 0

一句话总结

本文将二维卷积网络（2D CNN）与三维卷积网络（3D CNN）用于ERP基础的脑电分类进行对比，结果表明采用时序移位增强与测试时置信度投票的3D CNN优于二维变体，而CSP有助于2D模型。

ABSTRACT

Electroencephalography (EEG) classification plays a key role in brain-computer interface (BCI) systems, yet it remains challenging due to the low signal-to-noise ratio, temporal variability of neural responses, and limited data availability. In this paper, we present a comparative study of deep learning architectures for classifying event-related potentials (ERPs) in EEG signals. The preprocessing pipeline includes bandpass filtering, spatial filtering, and normalization. We design and compare three main pipelines: a 2D convolutional neural network (CNN) using Common Spatial Pattern (CSP), a second 2D CNN trained directly on raw data for a fair comparison, and a 3D CNN that jointly models spatiotemporal representations. To address ERP latency variations, we introduce a temporal shift augmentation strategy during training. At inference time, we employ a confidence-based test-time voting mechanism to improve prediction stability across shifted trials. An experimental evaluation on a stratified five-fold cross-validation protocol demonstrates that while CSP provides a benefit to the 2D architecture, the proposed 3D CNN significantly outperforms both 2D variants in terms of AUC and balanced accuracy. These findings highlight the effectiveness of temporal-aware architectures and augmentation strategies for robust EEG signal classification.

研究动机与目标

在低信噪比和潜伏期变异性下，推动在脑-机接口应用中对单次ERP分类的鲁棒性。
评估并比较多个深度学习管线（带CSP的2D CNN、无CSP的2D CNN，以及3D CNN）在ERP检测上的表现。
研究时序增强（时序移位）和测试时置信度投票，以缓解ERP潜伏期抖动。
在不同被试者上的性能量化，并确立在公平比较条件下3D架构是否优于2D架构。

提出的方法

对EEG数据进行带通滤波、表面拉普拉斯算子归一化；将数据切分为1000 ms的时段并下采样至64个时间点。
比较三条管线：带CSP的2D CNN（6个空间滤波器，64个时间点）、原始32通道的2D CNN，以及5D体积输入的3D CNN。
对于3D CNN，将32通道EEG映射到7x5网格，插值缺失位置，并应用时序移位增强，创建移位集合{-2,-1,0,1,2}。
使用focal loss和类别权重进行训练，应用批量归一化、GELU激活和 dropout；采用分层5折交叉验证并基于AUC进行早停。
在3D CNN测试阶段，对移位试验进行基于置信度的投票，并选择具有最高softmax置信度的预测。

实验结果

研究问题

RQ13D CNN是否比2D CNN在ERP分类中更有效地联合建模时空ERP模式？
RQ2将CSP预处理加入是否提升2D CNN相对于原始通道输入的性能？
RQ3时序移位增强和测试时置信度投票是否能提高对ERP潜伏期变异性的鲁棒性？
RQ4在公平比较条件下，3D CNN相对于表现最好的2D CNN的性能提升是多少？

主要发现

采用时序移位增强和基于置信度投票的3D CNN达到最高性能，在跨被试者的平均AUC最高可达0.994（使用GAP变体）。
2D CNN在CSP预处理下收益显著，最佳2D模型的平均AUC为0.866（有CSP） vs 0.857（无CSP）。
在3D CNN变体中，GAP版本提供最佳结果（平均AUC 0.994，std 0.003）。
最佳GAP的3D模型显著优于最佳带CSP的2D模型（AUC 0.866）及无CSP的2D模型（AUC 0.857）；配对t检验p < 0.001。
个体化评估显示每个被试的强劲表现；作者指出缺乏跨被试的泛化测试，以及未来迁移学习和可解释性增强的潜力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。