Drift Detection in Episodic Data: Detect When Your Agent Starts Faltering

10/22/2020
by   Ido Greenberg, et al.
0

Detection of deterioration of agent performance in dynamic environments is challenging due to the non-i.i.d nature of the observed performance. We consider an episodic framework, where the objective is to detect when an agent begins to falter. We devise a hypothesis testing procedure for non-i.i.d rewards, which is optimal under certain conditions. To apply the procedure sequentially in an online manner, we also suggest a novel Bootstrap mechanism for False Alarm Rate control (BFAR). We demonstrate our procedure in problems where the rewards are not independent, nor identically-distributed, nor normally-distributed. The statistical power of the new testing procedure is shown to outperform alternative tests - often by orders of magnitude - for a variety of environment modifications (which cause deterioration in agent performance). Our detection method is entirely external to the agent, and in particular does not require model-based learning. Furthermore, it can be applied to detect changes or drifts in any episodic signal.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/26/2017

A Flexible Framework for Hypothesis Testing in High-dimensions

Hypothesis testing in the linear regression model is a fundamental stati...
research
06/25/2018

Request-and-Reverify: Hierarchical Hypothesis Testing for Concept Drift Detection with Expensive Labels

One important assumption underlying common classification models is the ...
research
10/06/2020

Sequential Changepoint Detection in Neural Networks with Checkpoints

We introduce a framework for online changepoint detection and simultaneo...
research
03/09/2022

A continuous multiple hypothesis testing framework for optimal exoplanet detection

The detection of exoplanets is hindered by the presence of complex astro...
research
10/26/2020

Dynamic Algorithms for Online Multiple Testing

We demonstrate new algorithms for online multiple testing that provably ...
research
12/31/2019

On Testing for Biases in Peer Review

We consider the issue of biases in scholarly research, specifically, in ...
research
10/09/2022

QuTE: decentralized multiple testing on sensor networks with false discovery rate control

This paper designs methods for decentralized multiple hypothesis testing...

Please sign up or login with your details

Forgot password? Click here to reset