Interpretable Outlier Summarization

03/11/2023
by   Yu Wang, et al.
0

Outlier detection is critical in real applications to prevent financial fraud, defend network intrusions, or detecting imminent device failures. To reduce the human effort in evaluating outlier detection results and effectively turn the outliers into actionable insights, the users often expect a system to automatically produce interpretable summarizations of subgroups of outlier detection results. Unfortunately, to date no such systems exist. To fill this gap, we propose STAIR which learns a compact set of human understandable rules to summarize and explain the anomaly detection results. Rather than use the classical decision tree algorithms to produce these rules, STAIR proposes a new optimization objective to produce a small number of rules with least complexity, hence strong interpretability, to accurately summarize the detection results. The learning algorithm of STAIR produces a rule set by iteratively splitting the large rules and is optimal in maximizing this objective in each iteration. Moreover, to effectively handle high dimensional, highly complex data sets which are hard to summarize with simple rules, we propose a localized STAIR approach, called L-STAIR. Taking data locality into consideration, it simultaneously partitions data and learns a set of localized rules for each partition. Our experimental study on many outlier benchmark datasets shows that STAIR significantly reduces the complexity of the rules required to summarize the outlier detection results, thus more amenable for humans to understand and evaluate, compared to the decision tree methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2022

Towards Interpretable Anomaly Detection via Invariant Rule Mining

In the research area of anomaly detection, novel and promising methods a...
research
01/02/2020

Explainable outlier detection through decision tree conditioning

This work describes an outlier detection procedure (named "OutlierTree")...
research
02/21/2017

Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

We propose an inlier-based outlier detection method capable of both iden...
research
11/29/2021

Anomaly Rule Detection in Sequence Data

Analyzing sequence data usually leads to the discovery of interesting pa...
research
03/12/2018

Onion-Peeling Outlier Detection in 2-D data Sets

Outlier Detection is a critical and cardinal research task due its array...
research
12/10/2021

LUNAR: Unifying Local Outlier Detection Methods via Graph Neural Networks

Many well-established anomaly detection methods use the distance of a sa...
research
12/22/2022

Machine Learning with Probabilistic Law Discovery: A Concise Introduction

Probabilistic Law Discovery (PLD) is a logic based Machine Learning meth...

Please sign up or login with your details

Forgot password? Click here to reset