Monitoring and Anomaly Detection Actor-Critic Based Controlled Sensing

01/03/2022
by   Geethu Joseph, et al.
0

We address the problem of monitoring a set of binary stochastic processes and generating an alert when the number of anomalies among them exceeds a threshold. For this, the decision-maker selects and probes a subset of the processes to obtain noisy estimates of their states (normal or anomalous). Based on the received observations, the decisionmaker first determines whether to declare that the number of anomalies has exceeded the threshold or to continue taking observations. When the decision is to continue, it then decides whether to collect observations at the next time instant or defer it to a later time. If it chooses to collect observations, it further determines the subset of processes to be probed. To devise this three-step sequential decision-making process, we use a Bayesian formulation wherein we learn the posterior probability on the states of the processes. Using the posterior probability, we construct a Markov decision process and solve it using deep actor-critic reinforcement learning. Via numerical experiments, we demonstrate the superior performance of our algorithm compared to the traditional model-based algorithms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/08/2021

Scalable and Decentralized Algorithms for Anomaly Detection via Learning-Based Controlled Sensing

We address the problem of sequentially selecting and observing processes...
research
05/12/2021

A Scalable Algorithm for Anomaly Detection via Learning-Based Controlled Sensing

We address the problem of sequentially selecting and observing processes...
research
05/12/2021

Anomaly Detection via Controlled Sensing and Deep Active Inference

In this paper, we address the anomaly detection problem where the object...
research
08/28/2019

Deep Actor-Critic Reinforcement Learning for Anomaly Detection

Anomaly detection is widely applied in a variety of domains, involving f...
research
11/22/2022

Decision-making with Imaginary Opponent Models

Opponent modeling has benefited a controlled agent's decision-making by ...
research
02/20/2022

Learning to Control Partially Observed Systems with Finite Memory

We consider the reinforcement learning problem for partially observed Ma...
research
08/21/2022

Robust Tests in Online Decision-Making

Bandit algorithms are widely used in sequential decision problems to max...

Please sign up or login with your details

Forgot password? Click here to reset