Role of Data Augmentation in Unsupervised Anomaly Detection

08/16/2022
by   Jaemin Yoo, et al.
0

Self-supervised learning (SSL) has emerged as a promising alternative to create supervisory signals to real-world tasks, avoiding extensive cost of careful labeling. SSL is particularly attractive for unsupervised problems such as anomaly detection (AD), where labeled anomalies are costly to secure, difficult to simulate, or even nonexistent. A large catalog of augmentation functions have been used for SSL-based AD (SSAD), and recent works have observed that the type of augmentation has a significant impact on performance. Motivated by those, this work sets out to put SSAD under a larger lens and carefully investigate the role of data augmentation in AD through extensive experiments on many testbeds. Our main finding is that self-supervision acts as a yet-another model hyperparameter, and should be chosen carefully in regards to the nature of true anomalies in the data. That is, the alignment between the augmentation and the underlying anomaly-generating mechanism is the key for the success of SSAD, and in the lack thereof, SSL can even impair (!) detection performance. Moving beyond proposing another SSAD method, our study contributes to the better understanding of this growing area and lays out new directions for future research.

READ FULL TEXT

page 5

page 13

page 14

page 16

research
08/28/2023

Self-Supervision for Tackling Unsupervised Anomaly Detection: Pitfalls and Opportunities

Self-supervised learning (SSL) is a growing torrent that has recently tr...
research
04/06/2023

What makes a good data augmentation for few-shot unsupervised image anomaly detection?

Data augmentation is a promising technique for unsupervised anomaly dete...
research
12/12/2021

DeepFIB: Self-Imputation for Time Series Anomaly Detection

Time series (TS) anomaly detection (AD) plays an essential role in vario...
research
06/14/2023

SaliencyCut: Augmenting Plausible Anomalies for Open-set Fine-Grained Anomaly Detection

Open-set fine-grained anomaly detection is a challenging task that requi...
research
09/21/2022

Improving Generalizability of Graph Anomaly Detection Models via Data Augmentation

Graph anomaly detection (GAD) is a vital task since even a few anomalies...
research
08/20/2023

Unilaterally Aggregated Contrastive Learning with Hierarchical Augmentation for Anomaly Detection

Anomaly detection (AD), aiming to find samples that deviate from the tra...
research
09/21/2021

Self-supervised Representation Learning for Reliable Robotic Monitoring of Fruit Anomalies

Data augmentation can be a simple yet powerful tool for autonomous robot...

Please sign up or login with your details

Forgot password? Click here to reset