QUICK REVIEW

[论文解读] Counterfactual Prediction with Deep Instrumental Variables Networks

Jason Hartford, Greg Lewis|arXiv (Cornell University)|Dec 30, 2016

Machine Learning and Data Classification参考文献 29被引用 33

一句话总结

本文提出 Deep IV，一种深度学习框架，通过将问题建模为两个连续的神经网络任务（第一阶段治疗预测与第二阶段反事实结果建模，并引入治疗分布损失），利用工具变量来估计观测数据中的因果效应。该方法在样本外表现更优，并支持贝叶斯与频率学派推断，在存在内生性时优于标准机器学习方法与两阶段最小二乘法（2SLS）。

ABSTRACT

We are in the middle of a remarkable rise in the use and capability of artificial intelligence. Much of this growth has been fueled by the success of deep learning architectures: models that map from observables to outputs via multiple layers of latent representations. These deep learning algorithms are effective tools for unstructured prediction, and they can be combined in AI systems to solve complex automated reasoning problems. This paper provides a recipe for combining ML algorithms to solve for causal effects in the presence of instrumental variables -- sources of treatment randomization that are conditionally independent from the response. We show that a flexible IV specification resolves into two prediction tasks that can be solved with deep neural nets: a first-stage network for treatment prediction and a second-stage network whose loss function involves integration over the conditional treatment distribution. This Deep IV framework imposes some specific structure on the stochastic gradient descent routine used for training, but it is general enough that we can take advantage of off-the-shelf ML capabilities and avoid extensive algorithm customization. We outline how to obtain out-of-sample causal validation in order to avoid over-fit. We also introduce schemes for both Bayesian and frequentist inference: the former via a novel adaptation of dropout training, and the latter via a data splitting routine.

研究动机与目标

解决由于未观测混杂因素导致治疗分配内生时，在观测数据中估计因果效应的挑战。
开发一种可扩展、灵活的深度学习框架，将计量经济学中的工具变量理论与现代机器学习相结合。
为反事实预测实现样本外因果验证与不确定性量化（贝叶斯与频率学派）。
证明深度神经网络可在不施加强参数假设的前提下，有效建模 IV 框架下的非线性、异质性治疗效应。

提出的方法

该方法将因果推断问题分解为两个阶段：第一阶段使用深度神经网络，基于工具变量与协变量预测治疗。
第二阶段网络将结果建模为治疗与协变量的函数，其损失函数通过积分条件治疗分布以校正内生性。
通过调整随机梯度下降以尊重 IV 模型的结构约束，实现两个网络的联合训练。
通过一种新型的 dropout 适应方法实现贝叶斯推断，其中反向 dropout 概率作为后验不确定性的精度参数。
通过数据分割流程实现频率学派推断，从而在第二阶段预测上实现条件推断。
该框架支持使用深度神经网络对治疗效应进行灵活、非参数化建模，避免线性或参数化假设。

实验结果

研究问题

RQ1深度神经网络能否与工具变量方法有效结合，以在复杂、高维的观测数据中估计因果效应？
RQ2如何实现样本外因果验证，以防止反事实预测模型过拟合？
RQ3在深度 IV 模型背景下，贝叶斯与频率学派推断方法的相对优势为何？
RQ4在存在内生性的情况下，Deep IV 框架相较于标准机器学习方法与 2SLS 的表现优势有多大？

主要发现

在样本外结构均方误差方面，Deep IV 显著优于标准前馈神经网络（FFNet），尤其在高内生性条件下，FFNet 无法恢复真实反事实。
与 FFNet 在高内生性下表现差不同，Deep IV 在所有内生性水平下均保持强性能，且随着训练数据增加而进一步提升。
尽管受到线性与同质性假设的限制，两阶段最小二乘法（2SLS）仍优于 FFNet，但被 Deep IV 超越，尤其在大数据集下表现更优。
使用反向 dropout 概率 c=0.99 的 Deep IV 贝叶斯不确定性估计提供了可靠的覆盖率，且大多能恢复真实反事实形状，尽管区间比频率学派方法更宽。
通过数据分割实现的频率学派推断得到的不确定性估计比贝叶斯 dropout 更紧密，但两种方法均有效，且依赖于调参选择。
在包含 100 万条观测数据的测试集中，Deep IV 的样本外均方误差为 0.026（c=0.99），表明其泛化能力强，且误差随数据量增加持续降低。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。