Outlier detection for mixed-type data: A novel approach

08/18/2023
by   Efthymios Costa, et al.
0

Outlier detection can serve as an extremely important tool for researchers from a wide range of fields. From the sectors of banking and marketing to the social sciences and healthcare sectors, outlier detection techniques are very useful for identifying subjects that exhibit different and sometimes peculiar behaviours. When the data set available to the researcher consists of both discrete and continuous variables, outlier detection presents unprecedented challenges. In this paper we propose a novel method that detects outlying observations in settings of mixed-type data, while reducing the required user interaction which can lead to misleading findings caused by subjectivity. The methodology developed is being assessed through a series of simulations on data sets with varying characteristics and achieves very good performance levels. Our method demonstrates a high capacity for detecting the majority of outliers while minimising the number of falsely detected non-outlying observations. The ideas and techniques outlined in the paper can be used either as a pre-processing step or in tandem with other data mining and machine learning algorithms for developing novel approaches to challenging research problems.

READ FULL TEXT
research
06/19/2014

Robust Outlier Detection Technique in Data Mining: A Univariate Approach

Outliers are the points which are different from or inconsistent with th...
research
06/20/2021

Outlier Detection and Spatial Analysis Algorithms

Outlier detection is a significant area in data mining. It can be either...
research
10/18/2021

Fast and Exact Outlier Detection in Metric Spaces: A Proximity Graph-based Approach

Distance-based outlier detection is widely adopted in many fields, e.g.,...
research
03/03/2021

Detecting Outliers in High-dimensional Data with Mixed Variable Types using Conditional Gaussian Regression Models

Outlier detection has gained increasing interest in recent years, due to...
research
11/13/2018

Nonparametric geometric outlier detection

Outlier detection is a major topic in robust statistics due to the high ...
research
05/12/2022

Outlier Detection for Multi-Network Data

It has become routine in neuroscience studies to measure brain networks ...
research
05/09/2023

Spatially smoothed robust covariance estimation for local outlier detection

Most multivariate outlier detection procedures ignore the spatial depend...

Please sign up or login with your details

Forgot password? Click here to reset