Policy Design for Active Sequential Hypothesis Testing using Deep Learning

10/11/2018
by   Dhruva Kartik, et al.
0

Information theory has been very successful in obtaining performance limits for various problems such as communication, compression and hypothesis testing. Likewise, stochastic control theory provides a characterization of optimal policies for Partially Observable Markov Decision Processes (POMDPs) using dynamic programming. However, finding optimal policies for these problems is computationally hard in general and thus, heuristic solutions are employed in practice. Deep learning can be used as a tool for designing better heuristics in such problems. In this paper, the problem of active sequential hypothesis testing is considered. The goal is to design a policy that can reliably infer the true hypothesis using as few samples as possible by adaptively selecting appropriate queries. This problem can be modeled as a POMDP and bounds on its value function exist in literature. However, optimal policies have not been identified and various heuristics are used. In this paper, two new heuristics are proposed: one based on deep reinforcement learning and another based on a KL-divergence zero-sum game. These heuristics are compared with state-of-the-art solutions and it is demonstrated using numerical experiments that the proposed heuristics can achieve significantly better performance than existing methods in some scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/04/2018

Sequential Experiment Design for Hypothesis Verification

Hypothesis testing is an important problem with applications in target l...
research
03/19/2023

Active hypothesis testing in unknown environments using recurrent neural networks and model free reinforcement learning

A combination of deep reinforcement learning and supervised learning is ...
research
12/12/2019

Learning Improvement Heuristics for Solving the Travelling Salesman Problem

Recent studies in using deep learning to solve the Travelling Salesman P...
research
09/12/2017

Information Design in Crowdfunding under Thresholding Policies

In crowdfunding, an entrepreneur often has to decide how to disclose the...
research
03/07/2021

Approximation Algorithms for Active Sequential Hypothesis Testing

In the problem of active sequential hypotheses testing (ASHT), a learner...
research
11/15/2019

Fixed-horizon Active Hypothesis Testing

Two active hypothesis testing problems are formulated. In these problems...
research
04/19/2020

Sequential hypothesis testing in machine learning driven crude oil jump detection

In this paper we present a sequential hypothesis test for the detection ...

Please sign up or login with your details

Forgot password? Click here to reset