QUICK REVIEW

[论文解读] Natural Synthetic Anomalies for Self-Supervised Anomaly Detection and Localization

Hannah M. Schlüter, Jeremy Tan|arXiv (Cornell University)|Sep 30, 2021

Anomaly Detection Techniques and Applications被引用 48

一句话总结

论文提出 Natural Synthetic Anomalies (NSA)，一种 Poisson-blending 自监督任务，使用仅有的正常数据来训练端到端的异常检测与定位模型，在 MVTec AD 上无需额外数据即可达到最先进的结果。

ABSTRACT

We introduce a simple and intuitive self-supervision task, Natural Synthetic Anomalies (NSA), for training an end-to-end model for anomaly detection and localization using only normal training data. NSA integrates Poisson image editing to seamlessly blend scaled patches of various sizes from separate images. This creates a wide range of synthetic anomalies which are more similar to natural sub-image irregularities than previous data-augmentation strategies for self-supervised anomaly detection. We evaluate the proposed method using natural and medical images. Our experiments with the MVTec AD dataset show that a model trained to localize NSA anomalies generalizes well to detecting real-world a priori unknown types of manufacturing defects. Our method achieves an overall detection AUROC of 97.2 outperforming all previous methods that learn without the use of additional datasets. Code available at https://github.com/hmsch/natural-synthetic-anomalies.

研究动机与目标

激发并设计一个简单的自监督任务，用于异常检测与定位，仅使用正常训练数据。
通过 Poisson 图像编辑创建多样化、自然风格的合成异常，以提升对真实缺陷的泛化能力。
开发一个端到端的模型，输出像素级别的异常图，而不依赖预训练的编码器。
在自然图像和医学影像上评估 NSA，以展示跨领域的适用性。

提出的方法

使用 Poisson 图像编辑，将源图像中的缩放补丁无缝混合到目标图像中，创建合成异常。
使用截断 Gamma 分布采样补丁大小，并施加对象重叠约束，确保补丁影响到对象。
提供来自混合图像与目标图像强度差的像素级标签，可选二值、连续或对数几率（logistic）目标。
使用像素级损失训练编码器–解码器网络（对于有界标签用二元交叉熵，对于无界连续标签用均方误差）。
允许每张图像有多个补丁，并使用对象/背景掩码，确保真实的异常场景。

实验结果

研究问题

RQ1NSA 是否能够生成多样、真实感强的合成异常，从而在没有外部数据的情况下提升端到端的异常检测与定位？
RQ2与此前的自监督增强方案（CutPaste、FPI、PII）相比，在检测与定位准确性方面 NSA 表现如何？
RQ3在自然图像上训练的 NSA 是否能够在医学影像任务（如胸部 X 线异常检测）中无额外数据集地泛化？

主要发现

NSA 在 MVTec AD 上实现了整体图像级 AUROC 97.2，优于 CutPaste (3-way) 2.0 并且可以与使用额外数据集的方法相媲美。
NSA（logistic）的像素级定位平均 AUROC 为 96.3，与 DRAEM 相当，且高于若干在无额外数据情况下的基线。
NSA 训练的模型对真实世界缺陷具有强泛化能力，且在某些任务上可以与 ImageNet-预训练方法竞争。
在胸部 X-ray 实验（rCXR）中，NSA 超越端到端的 FPI 和 CutPaste，并且也优于 PII，展示跨域适用性。
连续（对数）标签通常在对象定位上表现更好，而二值标签对许多类别给出稳健结果；无界连续标签可能稳定性较差。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。