QUICK REVIEW

[论文解读] Metric Learning for Novelty and Anomaly Detection

Marc Masana, Idoia Ruiz|arXiv (Cornell University)|Aug 16, 2018

Anomaly Detection Techniques and Applications参考文献 23被引用 51

一句话总结

本文提出一种基于度量学习的方法用于分布外检测，区分新颖性与异常，并在包括交通标志识别在内的若干基准测试中，与基于交叉熵的方法相比取得了具有竞争力的结果。

ABSTRACT

When neural networks process images which do not resemble the distribution seen during training, so called out-of-distribution images, they often make wrong predictions, and do so too confidently. The capability to detect out-of-distribution images is therefore crucial for many real-world applications. We divide out-of-distribution detection between novelty detection ---images of classes which are not in the training set but are related to those---, and anomaly detection ---images with classes which are unrelated to the training set. By related we mean they contain the same type of objects, like digits in MNIST and SVHN. Most existing work has focused on anomaly detection, and has addressed this problem considering networks trained with the cross-entropy loss. Differently from them, we propose to use metric learning which does not have the drawback of the softmax layer (inherent to cross-entropy methods), which forces the network to divide its prediction power over the learned classes. We perform extensive experiments and evaluate both novelty and anomaly detection, even in a relevant application such as traffic sign recognition, obtaining comparable or better results than previous works.

研究动机与目标

在神经网络中检测分布外输入的重要性成为研究动机。
在分布外检测中，将新颖性（相关未见类别）与异常（无关数据）区分开来。
提出一种避免软最大（softmax）信心缺陷的度量学习框架。
证明基于嵌入的方法能够获得强力的新颖性和异常检测结果。
将该方法应用于现实世界的交通标志识别，以验证实际性能。

提出的方法

引入一个嵌入，使欧氏距离衡量分布外概率。
使用度量学习损失（对比损失）将同类样本拉近、将不同类样本推开。
扩展损失以加入分布外挖掘（ODM），在训练中包含已见/未见的分布外数据。
使用单分支网络高效地挖掘嵌入对，而无需孪生网络结构。
将度量学习（ML）和分布外挖掘（ODM）与基于交叉熵的方法基线以及最新方法（CC-AG、ODIN）进行比较。

实验结果

研究问题

RQ1度量学习嵌入是否能比基于 softmax 的分类器更好地提高新颖性和异常的检测？
RQ2新颖性和异常的分布外样本在嵌入空间中有何差异，ODM 如何影响这种分离？
RQ3在训练中加入已见的分布外数据是否能提高对新颖性和异常的检测？
RQ4所提出的 ODM 方法是否在标准基准（SVHN、CIFAR-10）和现实任务（交通标志）中均有效？

主要发现

度量学习获得的嵌入在分布内样本按类别聚类，分布外样本则远离这些聚类。
分布外挖掘（ODM）进一步提升了分布内与分布外样本之间的分离，有助于检测。
与基于交叉熵的方法相比，度量学习在新颖性与异常处理方面表现更好，能减少未见数据被自信错误分类的情况。
在交通标志识别中，ODM 实现了具有竞争力的新颖性与异常检测，同时保持了合理的分布内分类性能。
结果表明分布外数据中的新颖性与异常之间存在连续性，异常检测相较新颖性检测更容易。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。