QUICK REVIEW

[论文解读] On Maximum Likelihood Training of Score-Based Generative Models.

Conor Durkan, Yang Song|arXiv (Cornell University)|Jan 22, 2021

Generative Adversarial Networks and Image Synthesis参考文献 61被引用 5

一句话总结

本文证明，通过连续混合得分匹配损失的得分生成建模在特定混合权重方案下，数学上等价于最大似然训练。关键贡献是一种有原则的加权方法，可仅使用得分函数实现最大似然训练和对数似然评估，无需显式参数化数据密度。

ABSTRACT

Score-based generative modeling has recently emerged as a promising alternative to traditional likelihood-based or implicit approaches. Learning in score-based models involves first perturbing data with a continuous-time stochastic process, and then matching the time-dependent gradient of the logarithm of the noisy data density - or score function - using a continuous mixture of score matching losses. In this note, we show that such an objective is equivalent to maximum likelihood for certain choices of mixture weighting. This connection provides a principled way to weight the objective function, and justifies its use for comparing different score-based generative models. Taken together with previous work, our result reveals that both maximum likelihood training and test-time log-likelihood evaluation can be achieved through parameterization of the score function alone, without the need to explicitly parameterize a density function.

研究动机与目标

建立得分生成建模与最大似然训练之间的理论联系。
确定连续混合得分匹配损失实现最大似然优化的条件。
提供一种有原则的得分匹配目标加权方法，支持不同模型之间的有效似然比较。
证明仅使用得分函数即可完成训练和推理时的对数似然评估，无需显式密度参数化。

提出的方法

本文将时变得分匹配目标分析为时间上的连续损失混合。
推导出在时间变量的特定加权函数下，混合目标与最大似然之间的等价性。
分析利用了福克-普朗克方程，以及得分函数与数据密度对数梯度之间的关系。
该方法表明，实现最大似然的最优加权由扰动过程方差的时间导数决定。
证明仅使用得分函数即可计算训练目标和推理时的对数似然。

实验结果

研究问题

RQ1在何种条件下，连续混合得分匹配损失等价于最大似然训练？
RQ2何种最优加权方案可使得分匹配目标确保与最大似然的等价性？
RQ3推理时的对数似然评估是否可无需显式参数化数据密度而完成？
RQ4得分函数参数化如何实现最大似然框架下的训练与评估？

主要发现

当混合权重按扰动方差的时间导数选择时，连续混合得分匹配损失与最大似然训练等价。
该等价性为目标加权提供了有原则的基础，支持不同得分模型之间的公平比较。
仅使用得分函数即可计算训练目标和推理时的对数似然，无需显式密度估计。
该结果统一了得分建模与最大似然，表明通过得分函数参数化可实现基于似然的训练与评估。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。