Calibrated Nonparametric Scan Statistics for Anomalous Pattern Detection in Graphs

06/26/2022
by   Chunpai Wang, et al.
0

We propose a new approach, the calibrated nonparametric scan statistic (CNSS), for more accurate detection of anomalous patterns in large-scale, real-world graphs. Scan statistics identify connected subgraphs that are interesting or unexpected through maximization of a likelihood ratio statistic; in particular, nonparametric scan statistics (NPSSs) identify subgraphs with a higher than expected proportion of individually significant nodes. However, we show that recently proposed NPSS methods are miscalibrated, failing to account for the maximization of the statistic over the multiplicity of subgraphs. This results in both reduced detection power for subtle signals, and low precision of the detected subgraph even for stronger signals. Thus we develop a new statistical approach to recalibrate NPSSs, correctly adjusting for multiple hypothesis testing and taking the underlying graph structure into account. While the recalibration, based on randomization testing, is computationally expensive, we propose both an efficient (approximate) algorithm and new, closed-form lower bounds (on the expected maximum proportion of significant nodes for subgraphs of a given size, under the null hypothesis of no anomalous patterns). These advances, along with the integration of recent core-tree decomposition methods, enable CNSS to scale to large real-world graphs, with substantial improvement in the accuracy of detected subgraphs. Extensive experiments on both semi-synthetic and real-world datasets are demonstrated to validate the effectiveness of our proposed methods, in comparison with state-of-the-art counterparts.

READ FULL TEXT

page 25

page 28

page 31

page 32

research
02/16/2018

Learning Patterns for Detection with Multiscale Scan Statistics

This paper addresses detecting anomalous patterns in images, time-series...
research
03/28/2022

Detection threshold for correlated Erdős-Rényi graphs via densest subgraphs

The problem of detecting edge correlation between two Erdős-Rényi random...
research
12/11/2013

Near-optimal Anomaly Detection in Graphs using Lovasz Extended Scan Statistic

The detection of anomalous activity in graphs is a statistical problem t...
research
10/21/2019

Sparse Networks with Core-Periphery Structure

We propose a statistical model for graphs with a core-periphery structur...
research
04/04/2018

Gaussian Process Subset Scanning for Anomalous Pattern Detection in Non-iid Data

Identifying anomalous patterns in real-world data is essential for under...
research
07/01/2014

Significant Subgraph Mining with Multiple Testing Correction

The problem of finding itemsets that are statistically significantly enr...
research
01/29/2014

A Spectral Framework for Anomalous Subgraph Detection

A wide variety of application domains are concerned with data consisting...

Please sign up or login with your details

Forgot password? Click here to reset