QUICK REVIEW

[论文解读] Pathway Lasso: Estimate and Select Sparse Mediation Pathways with High Dimensional Mediators

Yi Zhao, Xi Luo|arXiv (Cornell University)|Mar 24, 2016

Advanced Causal Inference Techniques参考文献 19被引用 27

一句话总结

本文提出Pathway Lasso，一种新型的正则化结构方程模型（SEM）方法，用于在高维中介变量设置中估计并选择稀疏的中介路径。通过在路径效应上引入凸松弛惩罚，该方法实现了稳定且高维的因果中介路径估计与选择，在模拟数据和fMRI数据中均优于现有方法，显著提升了选择准确率并降低了偏差。

ABSTRACT

In many scientific studies, it becomes increasingly important to delineate the causal pathways through a large number of mediators, such as genetic and brain mediators. Structural equation modeling (SEM) is a popular technique to estimate the pathway effects, commonly expressed as products of coefficients. However, it becomes unstable to fit such models with high dimensional mediators, especially for a general setting where all the mediators are causally dependent but the exact causal relationships between them are unknown. This paper proposes a sparse mediation model using a regularized SEM approach, where sparsity here means that a small number of mediators have nonzero mediation effects between a treatment and an outcome. To address the model selection challenge, we innovate by introducing a new penalty called Pathway Lasso. This penalty function is a convex relaxation of the non-convex product function, and it enables a computationally tractable optimization criterion to estimate and select many pathway effects simultaneously. We develop a fast ADMM-type algorithm to compute the model parameters, and we show that the iterative updates can be expressed in closed form. On both simulated data and a real fMRI dataset, the proposed approach yields higher pathway selection accuracy and lower estimation bias than other competing methods.

研究动机与目标

解决当中介变量数量较大或超过样本量时，估计和选择因果中介路径的挑战。
克服传统SEM在中介变量存在因果依赖且高维时的不稳定性。
开发一种方法，识别稀疏路径（即仅少数中介变量具有非零中介效应），且无需已知中介变量的排序。
通过一种新颖且计算上可行的惩罚函数，实现路径效应的同时估计与选择。
在高维fMRI数据的神经影像实际应用中，提升结果的可重复性与鲁棒性。

提出的方法

提出一种新的SEM表示方法，无需中介变量因果排序知识，即可对每个中介变量的总中介效应进行建模。
引入Pathway Lasso，即对非凸的系数乘积路径效应进行凸松弛，实现稀疏估计。
使用增广拉格朗日法结合交替方向乘子法（ADMM）求解优化问题。
推导出ADMM算法中所有参数的闭式更新，确保计算效率。
直接应用于原始数据，无需对中介变量进行变换或边际化处理。
采用10折交叉验证进行调参，并利用自助法在路径选择后评估路径的显著性。

实验结果

研究问题

RQ1能否设计一种凸惩罚，以在高维且存在因果依赖的中介变量设置中，有效估计并选择稀疏中介路径？
RQ2与边际SEM和两阶段Lasso相比，所提出的Pathway Lasso方法在路径选择准确率和估计偏差方面表现如何？
RQ3与现有方法相比，该方法在重复fMRI扫描中是否能产生更稳定、可重复的结果？
RQ4该方法能否在高维独立成分的fMRI数据中识别出具有生物学意义的脑部路径？
RQ5在高维中介分析中，Pathway Lasso在保持高选择准确率的同时，能在多大程度上降低估计偏差？

主要发现

在fMRI数据集中，Pathway Lasso的路径选择Jaccard指数比TSLasso高出70%，表明其选择准确率更优。
路径效应估计的ℓ2差异方面，Pathway Lasso略逊于TSLasso（0.260 vs. 0.258），但其在估计和选择方面均表现出更高的可重复性。
在模拟研究中，Pathway Lasso的估计偏差更低，且路径选择准确率高于对比方法。
该方法成功识别出IC 27为显著路径，其中介效应始终为负（LR运行中为−0.063，RL运行中为−0.038），中介效应占比分别为31.7%和13.4%。
自助法验证了IC 27的显著性，该路径与背外侧前额叶皮层相关，与算术处理功能有关。
路径可视化揭示了一条从刺激区域到布罗德曼区44的脑网络，其中上额 gyrus 作为关键中介节点。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。