QUICK REVIEW

[Paper Review] Deep Nearest Neighbor Anomaly Detection

Liron Bergman, Niv Cohen|arXiv (Cornell University)|Feb 24, 2020

Anomaly Detection Techniques and Applications26 references108 citations

TL;DR

The paper shows that a simple kNN anomaly detector using Imagenet-pretrained ResNet features (DN2) outperforms state-of-the-art self-supervised and deep methods across unimodal, multimodal, unsupervised, and group anomaly tasks, with low sample complexity and no training stage.

ABSTRACT

Nearest neighbors is a successful and long-standing technique for anomaly detection. Significant progress has been recently achieved by self-supervised deep methods (e.g. RotNet). Self-supervised features however typically under-perform Imagenet pre-trained features. In this work, we investigate whether the recent progress can indeed outperform nearest-neighbor methods operating on an Imagenet pretrained feature space. The simple nearest-neighbor based-approach is experimentally shown to outperform self-supervised methods in: accuracy, few shot generalization, training time and noise robustness while making fewer assumptions on image distributions.

Motivation & Objective

Motivate a simple, strong baseline for image anomaly detection using pretrained features and kNN.
Demonstrate DN2's performance across semi-supervised, unsupervised, and group anomaly settings.
Show robustness to training impurities, small datasets, and data distributions unlike Imagenet.
Compare DN2 to self-supervised and deep learning based anomaly detectors.
Provide practical guidance on when and how to apply DN2 (and its limitations).

Proposed method

Embed all images with a pretrained feature extractor (Imagenet-pretrained ResNet).
Compute k nearest neighbor distances in feature space as anomaly scores.
For semi-supervised: classify by thresholding kNN distance to training embeddings.
For unsupervised: perform a cleaning stage by removing high-density-outlier images before re-running DN2.
For group anomaly detection: pool image features within a set by averaging and apply DN2 on the group embedding.
Discuss practical speedups via clustering training features for approximate kNN retrieval.

Experimental results

Research questions

RQ1Does kNN on pretrained image features outperform self-supervised and deep feature methods for anomaly detection?
RQ2How does DN2 perform under semi-supervised, unsupervised, and group anomaly settings?
RQ3What is the impact of network depth, number of neighbors, and data invariances on DN2 performance?
RQ4Is DN2 effective with small training sets and across diverse image domains?
RQ5Can DN2 robustly handle noisy or contaminated training data?

Key findings

DN2 often outperforms state-of-the-art anomaly detection methods across several datasets (e.g., CIFAR10, Fashion-MNIST, CIFAR100).
Using Imagenet-pretrained ResNet features yields strong locality in feature space, aiding kNN-based detection.
Deeper networks improve DN2 performance; two neighbors often suffice for optimal ROCAUC.
DN2 remains effective with very small training sets, unlike some self-supervised baselines.
A cleaning stage before unsupervised DN2 significantly mitigates performance loss due to impurities in the training data.
DN2 excels in group anomaly detection by mean-pooling group features before applying DN2; outperforms simple concatenation baselines.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.