QUICK REVIEW

[论文解读] An introduction to sampling via measure transport

Youssef Marzouk, Tarek Moselhy|arXiv (Cornell University)|Feb 16, 2016

Markov Chains and Monte Carlo Methods参考文献 80被引用 48

一句话总结

本文提出一种用于采样的测度传输方法，通过构建从简单参考测度（例如高斯分布）到复杂目标分布的确定性、可逆映射。通过求解变分问题来学习这些映射，该方法实现了从目标分布中高效、独立、无权重的采样，并可对MCMC进行预处理或实现高斯化处理，在高维设置下通过优化和近似空间扩展实现误差量化与自适应优化。

ABSTRACT

We present the fundamentals of a measure transport approach to sampling. The idea is to construct a deterministic coupling---i.e., a transport map---between a complex "target" probability measure of interest and a simpler reference measure. Given a transport map, one can generate arbitrarily many independent and unweighted samples from the target simply by pushing forward reference samples through the map. We consider two different and complementary scenarios: first, when only evaluations of the unnormalized target density are available, and second, when the target distribution is known only through a finite collection of samples. We show that in both settings the desired transports can be characterized as the solutions of variational problems. We then address practical issues associated with the optimization--based construction of transports: choosing finite-dimensional parameterizations of the map, enforcing monotonicity, quantifying the error of approximate transports, and refining approximate transports by enriching the corresponding approximation spaces. Approximate transports can also be used to "Gaussianize" complex distributions and thus precondition conventional asymptotically exact sampling schemes. We place the measure transport approach in broader context, describing connections with other optimization--based samplers, with inference and density estimation schemes using optimal transport, and with alternative transformation--based approaches to simulation. We also sketch current work aimed at the construction of transport maps in high dimensions, exploiting essential features of the target distribution (e.g., conditional independence, low-rank structure). The approaches and algorithms presented here have direct applications to Bayesian computation and to broader problems of stochastic simulation.

研究动机与目标

开发一种框架，用于从复杂、高维的概率分布中生成独立、无权重的样本，而无需依赖马尔可夫链蒙特卡洛（MCMC）或重要性采样。
解决在贝叶斯推断和不确定性量化中目标密度计算成本高或不可计算的挑战。
构建计算高效且能利用低维结构（如条件独立性或低秩依赖）的传输映射。
提供一种系统化的方法，通过近似空间的自适应丰富化来量化和减少近似传输映射中的误差。
使传输映射可作为传统采样方法（如MCMC）的预处理器，从而改善混合性和收敛性。

提出的方法

将传输映射表述为变分问题的解，以最小化参考测度与目标测度之间的差异（例如，Kullback-Leibler散度）。
使用有限维参数化方法（如多项式混沌或神经网络）构建映射，并通过Knothe-Rosenblatt重排或雅可比行列式约束来强制单调性。
通过基于优化的学习方法，从未归一化密度评估值或目标分布的有限样本集中确定映射参数。
在映射参数化中引入结构约束（如稀疏性、条件独立性），以降低维度并提高可扩展性。
应用一阶变分分析，指导优化过程中近似空间的自适应丰富化，从而在不完全重新训练的情况下提高精度。
利用学习到的映射将参考样本转换为目标样本，或通过三角结构生成条件样本。

实验结果

研究问题

RQ1能否构建一个确定性、可逆的映射，将来自简单参考测度的样本转换为复杂目标分布的独立、无权重样本？
RQ2当仅能获得未归一化密度评估值而无法直接访问累积分布函数时，如何学习传输映射？
RQ3结构假设（如条件独立性、低秩结构）在降低高维传输映射学习复杂度方面起什么作用？
RQ4如何通过近似空间的自适应丰富化来量化和减少近似传输映射的误差？
RQ5传输映射能否用作MCMC采样器的预处理器，以改善高维、强相关后验分布中的混合性和收敛性？

主要发现

该传输映射框架通过将样本从前置测度（如标准高斯分布）前推，实现了从复杂目标分布中生成独立、无权重样本。
当仅能获得未归一化密度时，可通过求解最小化前推测度与目标测度之间Kullback-Leibler散度的变分问题来学习传输映射。
当仅能获得目标分布的样本时，可通过对偶变分公式学习传输映射，从而实现从经验数据中进行密度估计和传输映射构建。
该框架通过映射中的三角结构支持高效的条件采样，可直接采样边缘分布和条件分布。
近似传输映射可用作MCMC算法的预处理器，在高维、相关性强的后验分布中显著改善混合性和收敛速度。
通过一阶变分分析指导的近似空间自适应丰富化，可在不完全重新优化的情况下系统性地减少误差并提高传输映射的精度。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。