QUICK REVIEW

[论文解读] LafitE: Latent Diffusion Model with Feature Editing for Unsupervised Multi-class Anomaly Detection

Haonan Yin, Guanlong Jiao|arXiv (Cornell University)|Jul 16, 2023

Anomaly Detection Techniques and Applications被引用 7

一句话总结

LafitE 使用带记忆的特征编辑策略的潜在扩散模型在无监督的多类异常检测与定位中提高重构质量并减少身份快捷方式，同时引入伪异常验证用于超参数调优。

ABSTRACT

In the context of flexible manufacturing systems that are required to produce different types and quantities of products with minimal reconfiguration, this paper addresses the problem of unsupervised multi-class anomaly detection: develop a unified model to detect anomalies from objects belonging to multiple classes when only normal data is accessible. We first explore the generative-based approach and investigate latent diffusion models for reconstruction to mitigate the notorious ``identity shortcut'' issue in auto-encoder based methods. We then introduce a feature editing strategy that modifies the input feature space of the diffusion model to further alleviate ``identity shortcuts'' and meanwhile improve the reconstruction quality of normal regions, leading to fewer false positive predictions. Moreover, we are the first who pose the problem of hyperparameter selection in unsupervised anomaly detection, and propose a solution of synthesizing anomaly data for a pseudo validation set to address this problem. Extensive experiments on benchmark datasets MVTec-AD and MPDD show that the proposed LafitE, \ie, Latent Diffusion Model with Feature Editing, outperforms state-of-art methods by a significant margin in terms of average AUROC. The hyperparamters selected via our pseudo validation set are well-matched to the real test set.

研究动机与目标

在灵活制造场景中，处理仅有正常数据且跨多类的无监督多类异常检测问题。
利用潜在扩散模型学习正常数据分布，并在潜在空间通过去噪重构异常。
引入基于记忆库的特征编辑策略，将异常输入拉回正常数据流形，降低身份快捷。
提出伪异常数据生成以形成伪验证集用于超参数调优。
在基准数据集（MVTec-AD 与 MPDD）上证明鲁棒性及最新性能。

提出的方法

通过从预训练网络进行分层多尺度特征提取，将输入图像映射到潜在特征空间。
在潜在空间上对正常数据训练潜在扩散模型（DDIM），建模正常特征分布。
使用记忆库中的正常补丁特征执行特征编辑步骤，在重构前用加权最近邻的正常补丁替换查询补丁。
从部分损坏的状态开始执行基于扩散的重构以去噪并获得重构的特征图。
基于原始特征与重构特征之间的重构误差计算像素/补丁级异常分数，随后进行图像级聚合以实现定位。
从正常样本合成异常数据，形成伪验证集以选择超参数（τ 和 K）。

实验结果

研究问题

RQ1在无监督设置下，潜在扩散模型能否更好地区分跨多类的正常与异常区域？
RQ2将记忆库中的特征编辑引入是否能提升重构质量并降低扩散式异常检测中的身份快捷？
RQ3从合成异常中派生的伪验证集是否能在无监督异常检测中有效引导超参数选择？
RQ4相较于最先进方法，LafitE 在不同数据集（MVTec-AD 与 MPDD）上的检测与定位性能如何？

主要发现

LafitE 在多类异常检测与定位方面对 MVTec-AD 和 MPDD 的平均 AUROC 达到了与多种基线相比的最新水平。
带记忆库的特征编辑在性能上始终优于纯扩散基线（LafitE w/o F.E.）。
潜在空间中的扩散模型有助于缓解身份快捷并实现对正常区域更好的重构，从而降低误检。
伪验证集中的合成异常数据有助于将超参数选择与真实测试性能对齐，支持有效的无监督调参。
消融研究显示 LafitE 相较 UniAD 等基线在各数据集上均有优势，并在类别间展现更好的一致性。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。