Efficient estimation of AUC in a sliding window

02/02/2019
by   Nikolaj Tatti, et al.
0

In many applications, monitoring area under the ROC curve (AUC) in a sliding window over a data stream is a natural way of detecting changes in the system. The drawback is that computing AUC in a sliding window is expensive, especially if the window size is large and the data flow is significant. In this paper we propose a scheme for maintaining an approximate AUC in a sliding window of length k. More specifically, we propose an algorithm that, given ϵ, estimates AUC within ϵ / 2, and can maintain this estimate in O(( k) / ϵ) time, per update, as the window slides. This provides a speed-up over the exact computation of AUC, which requires O(k) time, per update. The speed-up becomes more significant as the size of the window increases. Our estimate is based on grouping the data points together, and using these groups to calculate AUC. The grouping is designed carefully such that (i) the groups are small enough, so that the error stays small, (ii) the number of groups is small, so that enumerating them is not expensive, and (iii) the definition is flexible enough so that we can maintain the groups efficiently. Our experimental evaluation demonstrates that the average approximation error in practice is much smaller than the approximation guarantee ϵ / 2, and that we can achieve significant speed-ups with only a modest sacrifice in accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/23/2018

Sliding Suffix Tree

We consider a sliding window over a stream of characters from some finit...
research
10/29/2021

Improved Sliding Window Algorithms for Clustering and Coverage via Bucketing-Based Sketches

Streaming computation plays an important role in large-scale data analys...
research
04/16/2019

Almost-Smooth Histograms and Sliding-Window Graph Algorithms

We study algorithms for the sliding-window model, an important variant o...
research
12/12/2021

Maintaining AUC and H-measure over time

Measuring the performance of a classifier is a vital task in machine lea...
research
11/28/2015

Sliding-Window Optimization on an Ambiguity-Clearness Graph for Multi-object Tracking

Multi-object tracking remains challenging due to frequent occurrence of ...
research
01/01/2019

Approximation of the Cell Under Test in Sliding Window Detection Processes

Analysis of sliding window detection detection processes requires carefu...
research
07/06/2017

Adaptive Modular Exponentiation Methods v.s. Python's Power Function

In this paper we use Python to implement two efficient modular exponenti...

Please sign up or login with your details

Forgot password? Click here to reset