RODD: Robust Outlier Detection in Data Cubes

03/14/2023
by   Lara Kuhlmann, et al.
0

Data cubes are multidimensional databases, often built from several separate databases, that serve as flexible basis for data analysis. Surprisingly, outlier detection on data cubes has not yet been treated extensively. In this work, we provide the first framework to evaluate robust outlier detection methods in data cubes (RODD). We introduce a novel random forest-based outlier detection approach (RODD-RF) and compare it with more traditional methods based on robust location estimators. We propose a general type of test data and examine all methods in a simulation study. Moreover, we apply ROOD-RF to real world data. The results show that RODD-RF can lead to improved outlier detection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2022

Geometry- and Accuracy-Preserving Random Forest Proximities

Random forests are considered one of the best out-of-the-box classificat...
research
07/31/2019

Are Outlier Detection Methods Resilient to Sampling?

Outlier detection is a fundamental task in data mining and has many appl...
research
07/15/2019

Robust Variational Autoencoders for Outlier Detection in Mixed-Type Data

We focus on the problem of unsupervised cell outlier detection in mixed ...
research
07/28/2016

Robust Contextual Outlier Detection: Where Context Meets Sparsity

Outlier detection is a fundamental data science task with applications r...
research
11/02/2022

Analytical method for detecting outlier evaluators

Epidemiologic and medical studies often rely on evaluators to obtain mea...
research
08/14/2023

Quantifying Outlierness of Funds from their Categories using Supervised Similarity

Mutual fund categorization has become a standard tool for the investment...
research
04/26/2021

Unsupervised Instance Selection with Low-Label, Supervised Learning for Outlier Detection

The laborious process of labeling data often bottlenecks projects that a...

Please sign up or login with your details

Forgot password? Click here to reset