Robust Variational Autoencoder for Tabular Data with Beta Divergence

06/15/2020
by   Haleh Akrami, et al.
0

We propose a robust variational autoencoder with β divergence for tabular data (RTVAE) with mixed categorical and continuous features. Variational autoencoders (VAE) and their variations are popular frameworks for anomaly detection problems. The primary assumption is that we can learn representations for normal patterns via VAEs and any deviation from that can indicate anomalies. However, the training data itself can contain outliers. The source of outliers in training data include the data collection process itself (random noise) or a malicious attacker (data poisoning) who may target to degrade the performance of the machine learning model. In either case, these outliers can disproportionately affect the training process of VAEs and may lead to wrong conclusions about what the normal behavior is. In this work, we derive a novel form of a variational autoencoder for tabular data sets with categorical and continuous features that is robust to outliers in training data. Our results on the anomaly detection application for network traffic datasets demonstrate the effectiveness of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2019

Robust Variational Autoencoder

Machine learning methods often need a large amount of labeled training d...
research
05/05/2020

Interpreting Rate-Distortion of Variational Autoencoder and Using Model Uncertainty for Anomaly Detection

Building a scalable machine learning system for unsupervised anomaly det...
research
05/14/2021

DoS and DDoS Mitigation Using Variational Autoencoders

DoS and DDoS attacks have been growing in size and number over the last ...
research
06/09/2020

Novelty Detection via Robust Variational Autoencoding

We propose a new method for novelty detection that can tolerate nontrivi...
research
05/09/2022

ReCAB-VAE: Gumbel-Softmax Variational Inference Based on Analytic Divergence

The Gumbel-softmax distribution, or Concrete distribution, is often used...
research
06/06/2018

Universal Conditional Machine

We propose a single neural probabilistic model based on variational auto...
research
08/05/2022

Variational Autoencoders for Anomaly Detection in Respiratory Sounds

This paper proposes a weakly-supervised machine learning-based approach ...

Please sign up or login with your details

Forgot password? Click here to reset