QUICK REVIEW

[论文解读] Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems

Christoph Reisinger, Yufei Zhang|arXiv (Cornell University)|Mar 15, 2019

Model Reduction and Neural Networks参考文献 57被引用 36

一句话总结

本文证明，使用修正的深度神经网络可以以多项式（而非指数）复杂度，在高维、僵硬受控SDEs的零和博弈中逼近非光滑的值函数，有效地克服维数灾难。它还将这些结果与相关Kolmogorov偏微分方程的黏性解联系起来。

ABSTRACT

In this paper, we establish that for a wide class of controlled stochastic differential equations (SDEs) with stiff coefficients, the value functions of corresponding zero-sum games can be represented by a deep artificial neural network (DNN), whose complexity grows at most polynomially in both the dimension of the state equation and the reciprocal of the required accuracy. Such nonlinear stiff systems may arise, for example, from Galerkin approximations of controlled stochastic partial differential equations (SPDEs), or controlled PDEs with uncertain initial conditions and source terms. This implies that DNNs can break the curse of dimensionality in numerical approximations and optimal control of PDEs and SPDEs. The main ingredient of our proof is to construct a suitable discrete-time system to effectively approximate the evolution of the underlying stochastic dynamics. Similar ideas can also be applied to obtain expression rates of DNNs for value functions induced by stiff systems with regime switching coefficients and driven by general Lévy noise.

研究动机与目标

为来自SPDE/PDE离散化的高维、僵硬SDEs 的值函数逼近问题提供动机与形式化表述。
证明深度神经网络可以在维度和精度方面以多项式复杂度逼近这些值函数。
开发一个离散时间动力学框架以及一个两步的终端成本近似，以实现网络构建。
将结果推广到受控SDEs，并讨论与相关PDE黏性解的关系及含义。

提出的方法

构建一个合适的离散时间动力系统来逼近随机动力学的演变。
对终端成本进行两步近似，以处理二次增长的成本。
使用部分隐式Euler离散化来控制僵硬SDEs中的维度相关误差。
构建修正神经网络（ReLU）来表示离散时间动力学和成本泛函。
证明所得神经网络的复杂度在维度 d 和逆精度 ε 下多项式增长。
将该框架应用于无控制和有控制的SDEs，并将发现与Kolmogorov向后PDE 的黏性解联系起来。

实验结果

研究问题

RQ1深度神经网络能否在状态维度和精度的多项式复杂度内表示高维、僵硬SDEs 的值函数？
RQ2在SDE系数的单调性和正则性条件下，神经网络是否能够克服这些值函数的维数灾难？
RQ3如何通过离散时间（部分隐式）离散化来构建近似演变僵硬SDEs 的神经网络？
RQ4近似结果是否扩展到Kolmogorov向后PDE，以及具有等级切换或Lévy噪声扩展的受控SDE？

主要发现

存在一族 DNNs {ψ_{ε,d}}，其复杂度 ≤ c d^c ε^{-c}，在 L^2(ν_d) 下以精度 ε 逼近 v_d。
对 v_d 的近似误差通过相对于有限矩分布 ν_d 的 L^2 范数来度量，且网络达到目标 ε。
结果在对漂移和扩散系数的单调性条件下成立，该条件允许对解的正则性和离散化误差进行多项式-in-d 控制。
一条推论表明，具有僵硬系数的 Kolmogorov 向后PDE 的黏性解可以被具有多项式复杂度的 DNNs 近似。
该框架容纳时变非线性系数，并允许维度相关的 Lipschitz 常数，扩展了之前的仿射系数结果。
该方法使从 SPDEs 的Galerkin 近似及其他僵硬系统得到的值函数克服维数灾难成为可能。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。