QUICK REVIEW

[论文解读] Generative Classifiers as a Basis for Trustworthy Computer Vision.

Radek Mackowiak, Lynton Ardizzone|arXiv (Cornell University)|Jul 29, 2020

Adversarial Robustness in Machine Learning参考文献 18被引用 5

一句话总结

本文提出一种基于可逆神经网络的生成分类器（GC）架构，在ImageNet上实现了76.2%的top-1准确率，展示了其在保持内在可解释性和鲁棒性的同时具备具有竞争力的性能。该方法通过生成建模提升可信度，无需复杂的后处理修改，从而实现可信的计算机视觉。

ABSTRACT

With the maturing of deep learning systems, trustworthiness is becoming increasingly important for model assessment. We understand trustworthiness as the combination of explainability and robustness. Generative classifiers (GCs) are a promising class of models that are said to naturally accomplish these qualities. However, this has mostly been demonstrated on simple datasets such as MNIST, SVHN and CIFAR in the past. In this work, we firstly develop an architecture and training scheme that allows for GCs to be trained on the ImageNet classification task, a more relevant level of complexity for practical computer vision. The resulting models use an invertible neural network architecture and achieve a competetive ImageNet top-1 accuracy of up to 76.2%. Secondly, we show the large potential of GCs for trustworthiness. Explainability and some aspects of robustness are vastly improved compared to standard feed-forward models, even when the GCs are just applied naively. While not all trustworthiness problems are solved completely, we argue from our observations that GCs are an extremely promising basis for further algorithms and modifications, as have been developed in the past for feedforward models to increase their trustworthiness. We release our trained model for download in the hope that it serves as a starting point for various other generative classification tasks in much the same way as pretrained ResNet models do for discriminative classification.

研究动机与目标

开发一种可扩展的生成分类器架构，能够在ImageNet这一高复杂度数据集上进行训练，此前GC在更简单的基准测试中受限。
评估生成分类器是否在真实视觉场景中天然提升可信度，相比标准前馈模型，其可解释性和鲁棒性是否得到增强。
建立一个实用的、预训练的生成模型，为未来研究提供支持，类似于判别学习中的预训练ResNets。
探究是否在无需额外工程优化的情况下，直接应用GC即可实现显著的可信度提升。

提出的方法

设计一种基于可逆神经网络的新架构，以实现在ImageNet上端到端训练生成分类器。
使用似然最大化目标函数进行模型训练，以学习数据与类别分布的联合分布，从而支持生成与分类双重功能。
利用网络的可逆性，实现对输入图像的精确似然计算和精确后验推断。
应用标准的数据增强和优化技术（如权重衰减、标签平滑）以稳定训练过程并提升准确率。
利用学习到的生成模型生成类别条件样本，并通过可逆变换分析特征表示。
通过不确定性校准和特征解耦等内在属性评估可信度，不依赖外部可解释性工具。

实验结果

研究问题

RQ1生成分类器能否在ImageNet上成功训练，实现具有竞争力的准确率，同时保持其固有的可信度特性？
RQ2在真实世界视觉场景中，与标准前馈模型相比，生成分类器在可解释性和鲁棒性方面提升程度如何？
RQ3在无需额外算法改进的情况下，直接应用生成分类器是否已能实现显著的可信度提升？
RQ4在准确率和可信度指标方面，生成分类器与当前最先进的判别模型相比表现如何？

主要发现

所提出的生成分类器在ImageNet上实现了76.2%的top-1准确率，证明GC能够扩展至复杂的真实世界视觉任务。
可解释性显著提升，得益于模型生成类别条件样本以及将输入反演为潜在编码的能力，从而实现直观的解释。
鲁棒性得到增强，与标准前馈网络相比，该模型表现出更好的不确定性校准能力以及对对抗性扰动的更强抵抗力。
即使未引入额外可信度技术，生成分类器在内在可信度指标上仍优于标准模型，表明其具有内在优势。
训练好的模型已公开发布，可作为未来可信计算机视觉研究的强有力基线，类似于预训练ResNets。
结果表明，生成分类器是未来可信人工智能研究的极具前景的基础，可能减少对事后可解释性工具的依赖。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。