PyODDS: An End-to-end Outlier Detection System with Automated Machine Learning

03/12/2020
by   Yuening Li, et al.
0

Outlier detection is an important task for various data mining applications. Current outlier detection techniques are often manually designed for specific domains, requiring large human efforts of database setup, algorithm selection, and hyper-parameter tuning. To fill this gap, we present PyODDS, an automated end-to-end Python system for Outlier Detection with Database Support, which automatically optimizes an outlier detection pipeline for a new data source at hand. Specifically, we define the search space in the outlier detection pipeline, and produce a search strategy within the given search space. PyODDS enables end-to-end executions based on an Apache Spark backend server and a light-weight database. It also provides unified interfaces and visualizations for users with or without data science or machine learning background. In particular, we demonstrate PyODDS on several real-world datasets, with quantification analysis and visualization results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2019

PyODDS: An End-to-End Outlier Detection System

PyODDS is an end-to end Python system for outlier detection with databas...
research
06/19/2020

AutoOD: Automated Outlier Detection via Curiosity-guided Search and Self-imitation Learning

Outlier detection is an important data mining task with numerous practic...
research
02/13/2019

ATMSeer: Increasing Transparency and Controllability in Automated Machine Learning

To relieve the pain of manually selecting machine learning algorithms an...
research
08/01/2021

IPOF: An Extremely and Excitingly Simple Outlier Detection Booster via Infinite Propagation

Outlier detection is one of the most popular and continuously rising top...
research
07/28/2016

Robust Contextual Outlier Detection: Where Context Meets Sparsity

Outlier detection is a fundamental data science task with applications r...
research
05/10/2018

A Proposal for Outlier and Noise Detection in Public Officials' Affidavits

Outlier and noise detection processes are highly useful in the quality a...
research
06/02/2022

Sparx: Distributed Outlier Detection at Scale

There is no shortage of outlier detection (OD) algorithms in the literat...

Please sign up or login with your details

Forgot password? Click here to reset