QUICK REVIEW

[论文解读] VISION-ICE: Video-based Interpretation and Spatial Identification of Arrhythmia Origins via Neural Networks in Intracardiac Echocardiography

Dorsa EPMoghaddam, Feng Gao|arXiv (Cornell University)|Feb 4, 2026

Cardiac Arrhythmias and Treatments被引用 0

一句话总结

该研究提出一个使用心内超声（ICE）视频的3D CNN框架，将心律失常起源分为三类，在未见患者上的10折患者层级交叉验证中实现平均准确率66.2%。

ABSTRACT

Contemporary high-density mapping techniques and preoperative CT/MRI remain time and resource intensive in localizing arrhythmias. AI has been validated as a clinical decision aid in providing accurate, rapid real-time analysis of echocardiographic images. Building on this, we propose an AI-enabled framework that leverages intracardiac echocardiography (ICE), a routine part of electrophysiology procedures, to guide clinicians toward areas of arrhythmogenesis and potentially reduce procedural time. Arrhythmia source localization is formulated as a three-class classification task, distinguishing normal sinus rhythm, left-sided, and right-sided arrhythmias, based on ICE video data. We developed a 3D Convolutional Neural Network trained to discriminate among the three aforementioned classes. In ten-fold cross-validation, the model achieved a mean accuracy of 66.2% when evaluated on four previously unseen patients (substantially outperforming the 33.3% random baseline). These results demonstrate the feasibility and clinical promise of using ICE videos combined with deep learning for automated arrhythmia localization. Leveraging ICE imaging could enable faster, more targeted electrophysiological interventions and reduce the procedural burden of cardiac ablation. Future work will focus on expanding the dataset to improve model robustness and generalizability across diverse patient populations.

研究动机与目标

解决在ICE引导的电生理学手术中对心律失常起源进行高效、AI辅助定位的需求。
开发一个三类分类框架，在ICE视频中区分NSR、左侧及右侧心律失常。
创建一个经过筛选的ICE视频数据集，采用标准化预处理和数据增强。
在多种ICE视图及剪辑/患者层级上进行鲁棒交叉验证以评估性能。

提出的方法

对ICE视频分类使用带灰度输入干的预训练3D ResNet-18骨干网络。
通过掩码、时序分段、裁剪/缩放、归一化等步骤将数据处理为形状为(1,32,H,W)的张量。
使用AdamW进行训练，损失函数采用带类别权重的交叉熵，结合混合精度、早停和学习率调度。
进行10折跨病人交叉验证，针对不同解剖视图训练独立模型，并在剪辑和患者层级通过多数投票融合预测。
利用Grad-CAM可视化对预测有贡献的时空区域，以实现可解释性。

实验结果

研究问题

RQ1ICE视频数据是否能够在三类设置（NSR、DIST、PROX）下基于解剖视图有效定位心律失常起源？
RQ2在聚合多种ICE视图时，3D-CNN分类器在未见患者上的性能如何？
RQ3跨视图融合是否比单视图模型在心律失常起源定位中提升鲁棒性和准确性？
RQ4在ICE视频上用Grad-CAM可视化模型决策时，其可解释性有多高？

主要发现

跨视图多数投票在所有折叠上的剪辑级平均准确率达到66.20%。
视图逐一验证的准确率随折叠和视图而异，平均TV 71.27%，MV 77.09%，LPV 73.03%，CT 62.97%（跨折叠）。
跨视图融合提升鲁棒性，在若干折叠中产生比单视点结果更高的最终患者层级决策。
Grad-CAM可视化显示模型将注意力集中在生理相关的心房结构，支持有意义的特征学习。
研究表明在微调的3D CNN、迁移学习和数据增强的基础上，ICE视频用于心律失常定位是可行的。
训练采用严格的跨病人划分以防止数据泄漏并确保对未见患者的泛化。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。