QUICK REVIEW

[论文解读] Why Normalizing Flows Fail to Detect Out-of-Distribution Data

Polina Kirichenko, Pavel Izmailov|arXiv (Cornell University)|Jun 15, 2020

Generative Adversarial Networks and Image Synthesis参考文献 6被引用 31

一句话总结

本文分析了为什么 normalizing flows 常把 OOD 数据误判为分布内数据，并显示它们的归纳偏向使其依赖局部像素相关性而非语义结构；它演示了改进 OOD 检测的架构变动，并强调使用语义特征的好处。

ABSTRACT

Detecting out-of-distribution (OOD) data is crucial for robust machine learning systems. Normalizing flows are flexible deep generative models that often surprisingly fail to distinguish between in- and out-of-distribution data: a flow trained on pictures of clothing assigns higher likelihood to handwritten digits. We investigate why normalizing flows perform poorly for OOD detection. We demonstrate that flows learn local pixel correlations and generic image-to-latent-space transformations which are not specific to the target image dataset. We show that by modifying the architecture of flow coupling layers we can bias the flow towards learning the semantic structure of the target data, improving OOD detection. Our investigation reveals that properties that enable flows to generate high-fidelity images can have a detrimental effect on OOD detection.

研究动机与目标

识别为何 normalizing flows 在尽管具有精确似然度下仍无法检测到 OOD 数据。
分析耦合层流在 OOD 检测中造成阻碍的归纳偏差。
可视化潜在表示，以理解 flows 捕捉了哪些语义信息。
提出耦合层和掩蔽策略的体系结构调整，使 flows 倾向于语义结构。
评估具有语义丰富嵌入的对 OOD 检测的影响，与 flows 相关。

提出的方法

分析 RealNVP/基于 flow 的模型在分布内外数据集上的对数似然行为（例如 ImageNet、CelebA、SVHN）。
可视化中间耦合层激活以及学习到的 s（scale）和 t（shift）参数，以理解局部与语义表示。
研究耦合层机制（局部像素相关性和共适应）如何为分布内外数据驱动高似然度。
尝试掩蔽策略（棋盘格、水平、循环掩膜）以评估对 OOD 检测的影响。
在 st-networks 中引入瓶颈以限制容量并降低对局部像素相关性的依赖。
在高层语义嵌入而非原始像素上训练的 flows 展示了更好的 OOD 检测性能。

实验结果

研究问题

RQ1哪些在 normalizing flows 中的归纳偏差导致 OOD 检测效果不佳？
RQ2局部像素相关性和耦合层共适应是否使 flows 能对 OOD 数据分配高似然度？
RQ3对耦合层或掩蔽策略进行架构变动能否将 flows 转向语义表示以改善 OOD 检测？
RQ4在高层语义嵌入上训练是否比原始像素训练能更好地提升 OOD 检测？

主要发现

Flows 基于局部图形结构学习潜在表示，而非语义内容，从而削弱语义 OOD 检测。
耦合层能够在分布内外数据上准确预测被掩蔽的像素，从而为多样化的结构化图像驱动高似然度。
改变掩蔽策略或在 st-networks 中引入瓶颈，减少耦合层的共适应性，并提升分布内数据的 OOD 排名。
在具有语义内容的图像嵌入上训练 flows 相较于原始像素训练显著提升 OOD 检测。
使用高层特征（嵌入）而非原始像素数据时，OOD 检测提升，在分布内与 OOD 数据集之间实现更清晰的分离。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。