Outlier Detection from Network Data with Subnetwork Interpretation

09/30/2016
by   Xuan-Hong Dang, et al.
0

Detecting a small number of outliers from a set of data observations is always challenging. This problem is more difficult in the setting of multiple network samples, where computing the anomalous degree of a network sample is generally not sufficient. In fact, explaining why the network is exceptional, expressed in the form of subnetwork, is also equally important. In this paper, we develop a novel algorithm to address these two key problems. We treat each network sample as a potential outlier and identify subnetworks that mostly discriminate it from nearby regular samples. The algorithm is developed in the framework of network regression combined with the constraints on both network topology and L1-norm shrinkage to perform subnetwork discovery. Our method thus goes beyond subspace/subgraph discovery and we show that it converges to a global optimum. Evaluation on various real-world network datasets demonstrates that our algorithm not only outperforms baselines in both network and high dimensional setting, but also discovers highly relevant and interpretable local subnetworks, further enhancing our understanding of anomalous networks.

READ FULL TEXT

page 6

page 7

research
11/01/2016

Local Subspace-Based Outlier Detection using Global Neighbourhoods

Outlier detection in high-dimensional data is a challenging yet importan...
research
03/03/2021

Detecting Outliers in High-dimensional Data with Mixed Variable Types using Conditional Gaussian Regression Models

Outlier detection has gained increasing interest in recent years, due to...
research
02/16/2015

Random Subspace Learning Approach to High-Dimensional Outliers Detection

We introduce and develop a novel approach to outlier detection based on ...
research
09/28/2018

Generative Adversarial Active Learning for Unsupervised Outlier Detection

Outlier detection is an important topic in machine learning and has been...
research
11/16/2021

Automatically detecting anomalous exoplanet transits

Raw light curve data from exoplanet transits is too complex to naively a...
research
11/23/2021

Post-discovery Analysis of Anomalous Subsets

Analyzing the behaviour of a population in response to disease and inter...
research
09/15/2017

A Generic Framework for Interesting Subspace Cluster Detection in Multi-attributed Networks

Detection of interesting (e.g., coherent or anomalous) clusters has been...

Please sign up or login with your details

Forgot password? Click here to reset