An Evaluation of Classification and Outlier Detection Algorithms

05/02/2018
by   Victoria J. Hodge, et al.
0

This paper evaluates algorithms for classification and outlier detection accuracies in temporal data. We focus on algorithms that train and classify rapidly and can be used for systems that need to incorporate new data regularly. Hence, we compare the accuracy of six fast algorithms using a range of well-known time-series datasets. The analyses demonstrate that the choice of algorithm is task and data specific but that we can derive heuristics for choosing. Gradient Boosting Machines are generally best for classification but there is no single winner for outlier detection though Gradient Boosting Machines (again) and Random Forest are better. Hence, we recommend running evaluations of a number of algorithms using our heuristics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/06/2018

Outlier detection on network flow analysis

It is important to be able to detect and classify malicious network traf...
research
08/07/2020

A boosted outlier detection method based on the spectrum of the Laplacian matrix of a graph

This paper explores a new outlier detection algorithm based on the spect...
research
08/18/2021

Out-of-Distribution Detection using Outlier Detection Methods

Out-of-distribution detection (OOD) deals with anomalous input to neural...
research
12/27/2020

Effective Email Spam Detection System using Extreme Gradient Boosting

The popularity, cost-effectiveness and ease of information exchange that...
research
06/13/2023

Automating Microservices Test Failure Analysis using Kubernetes Cluster Logs

Kubernetes is a free, open-source container orchestration system for dep...
research
11/13/2018

Nonparametric geometric outlier detection

Outlier detection is a major topic in robust statistics due to the high ...

Please sign up or login with your details

Forgot password? Click here to reset