QUICK REVIEW

[论文解读] High-dimensional MCMC with a standard splitting scheme for the underdamped Langevin

Pierre Monmarché|arXiv (Cornell University)|Jul 10, 2020

Markov Chains and Monte Carlo Methods被引用 3

一句话总结

本文提出了一种基于非阻尼朗之万扩散的高维马尔可夫链蒙特卡洛（MCMC）采样器，采用标准二阶分裂积分器，每轮迭代仅需一次梯度计算。该方法建立了与维度无关的 Wasserstein 距离收缩性质，并为总变差距离和 Wasserstein 距离提供了非渐近收敛速率，其效率边界在不同光滑性假设下分别表现为 $\sqrt{d}/\varepsilon$、$\sqrt{d/\varepsilon}$ 和 $d^{1/4}/\sqrt{\varepsilon}$，与 HMC 和动力朗之万方法的已知性能表现一致。

ABSTRACT

The efficiency of a markov sampler based on the underdamped Langevin diffusion is studied for high dimensionial targets with convex and smooth potentials. We consider a classical second-order integrator which requires only one gradient computation per iteration. Contrary to previous works on similar samplers, a dimension-free contraction of Wasserstein distances and convergence rate for the total variance distance are proved for the discrete time chain itself. Non-asymptotic Wasserstein and total variation efficiency bounds and concentration inequalities are obtained for both the Metropolis adjusted and unadjusted chains. In terms of the dimension $d$ and the desired accuracy $\varepsilon$, the Wasserstein efficiency bounds are of order $\sqrt d / \varepsilon$ in the general case, $\sqrt{d/\varepsilon}$ if the Hessian of the potential is Lipschitz, and $d^{1/4}/\sqrt\varepsilon$ in the case of a separable target, in accordance with known results for other kinetic Langevin or HMC schemes.

研究动机与目标

开发一种用于对数凹、光滑且凸目标的高效高维 MCMC 采样器。
分析非阻尼朗之万扩散在高维下标准二阶分裂格式的收敛性质。
为未调整链和马尔可夫链调整链的 Wasserstein 距离与总变差距离建立非渐近界。
推导出在维度 $d$ 和精度 $\varepsilon$ 下最优缩放的效率边界，与 HMC 和动力朗之万方法的已知结果一致。

提出的方法

采用二阶分裂积分器对非阻尼朗之万 SDE 进行离散化，每轮迭代仅需一次梯度评估。
该方法利用非阻尼 SDE 的结构，确保在高维采样中的稳定性和准确性。
证明了离散时间马尔可夫链本身具有 Wasserstein 距离的收缩性质，而不仅是在极限情况下。
为未调整链和马尔可夫链调整链的版本分别推导出非渐近浓度不等式和总变差距离界。
分析依赖于凸且光滑势函数的性质，包括利普希茨 Hessian 条件，以推导出与维度相关的效率边界。

实验结果

研究问题

RQ1在高维设置下，基于分裂格式的离散时间非阻尼朗之万链的收敛速率是什么？
RQ2是否能为未调整链建立与维度无关的 Wasserstein 距离收缩性质？
RQ3在不同光滑性假设下，效率边界如何随维度 $d$ 和目标精度 $\varepsilon$ 变化？
RQ4性能边界与 HMC 和其他动力朗之万采样器相比如何？

主要发现

基于分裂格式的离散时间马尔可夫链表现出与维度无关的 Wasserstein 距离收缩性质。
为未调整链和马尔可夫链调整链分别建立了总变差距离的非渐近界。
在一般情况下，Wasserstein 效率边界表现为 $\sqrt{d}/\varepsilon$；当势函数的 Hessian 为利普希茨连续时，效率边界为 $\sqrt{d/\varepsilon}$；对于可分目标，效率边界为 $d^{1/4}/\sqrt{\varepsilon}$。
这些效率边界与 HMC 和动力朗之万方法的已知最优缩放一致，证实了该方法的竞争力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。