QUICK REVIEW

[论文解读] Learning Neural Random Fields with Inclusive Auxiliary Generators

Yunfu Song, Zhijian Ou|arXiv (Cornell University)|Sep 27, 2018

Generative Adversarial Networks and Image Synthesis参考文献 79被引用 26

一句话总结

本文提出 inclusive-NRF，一种新颖的方法，通过最小化包含发散的辅助生成器和随机梯度采样，学习用于连续数据的神经随机场。该方法实现了高质量的图像生成、半监督分类和异常检测，同时提供直接的未归一化密度估计——相较于 GAN，其关键优势在于结合了强大的生成性能与显式的似然评估。

ABSTRACT

Neural random fields (NRFs), which are defined by using neural networks to implement potential functions in undirected models (sometimes known as energy-based models), provide an interesting family of model spaces for machine learning, besides various directed models such as generative adversarial networks (GANs). In this paper we propose a new approach, the inclusive-NRF approach, to learning NRFs for continuous data (e.g. images), by developing inclusive-divergence minimized auxiliary generators and stochastic gradient sampling. As demonstrations of how the new approach can be flexibly and effectively used, specific inclusive-NRF models are developed and thoroughly evaluated for a number of tasks - unsupervised/supervised image generation, semi-supervised classification and anomaly detection. The proposed models consistently achieve strong experimental results in all these tasks compared to state-of-the-art methods. Remarkably, in addition to superior sample generation, one fundamental additional benefit of our inclusive-NRF approach is that, unlike GANs, it directly provides (unnormalized) density estimate for sample evaluation. With these contributions and results, this paper significantly advances the learning and applications of undirected models to a new level, both theoretically and empirically, which have never been obtained before.

研究动机与目标

为解决现有无向模型在学习连续数据时的局限性，特别是缺乏高效密度估计的问题。
开发一种灵活且高效的神经随机场训练框架，支持样本的生成与评估。
通过将未归一化密度估计集成到能量模型中，克服 GAN 的根本缺陷——缺乏密度估计。
在多种任务中验证所提方法的有效性，包括无监督生成、半监督学习和异常检测。

提出的方法

该方法采用包含发散最小化来训练辅助生成器，以引导神经随机场的学习。
使用随机梯度采样以高效优化能量模型参数。
使用神经网络参数化无向模型中的势函数，为连续数据构建灵活的能量函数。
通过统一目标函数，将生成器与能量模型的训练整合，以最小化包含发散。
该模型可对任意生成样本直接提供未归一化密度估计，支持超越生成质量的模型评估。

实验结果

研究问题

RQ1能否设计一种神经随机场框架，以同时支持高保真样本生成和连续数据的显式密度估计？
RQ2与标准 GAN 式目标相比，包含发散最小化在无向模型中如何提升训练稳定性和性能？
RQ3所提方法在图像生成、半监督分类和异常检测等多样化任务中的泛化能力如何？
RQ4在实际应用中，未归一化密度估计的引入是否为 GAN 提供了实际优势？

主要发现

inclusive-NRF 方法在所有评估任务中均达到最先进性能，包括无监督和有监督的图像生成。
在半监督图像分类任务中，该模型始终优于现有方法，展现出从有限标注数据中强大泛化能力。
在异常检测中，由于能够估计未归一化密度，模型能有效识别分布外样本。
与 GAN 不同，所提方法可对任意样本直接提供未归一化密度估计，从而实现超越定性生成的模型评估与选择。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。