Partial Counterfactual Identification for Infinite Horizon Partially Observable Markov Decision Process

This paper investigates the problem of bounding possible output from a counterfactual query given a set of observational data. While various works of literature have described methodologies to generate efficient algorithms that provide an optimal bound for the counterfactual query, all of them assume a finite-horizon causal diagram. This paper aims to extend the previous work by modifying Q-learning algorithm to provide informative bounds of a causal query given an infinite-horizon causal diagram. Through simulations, our algorithms are proven to perform better compared to existing algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Partial Counterfactual Identification from Observational and Experimental Data

This paper investigates the problem of bounding counterfactual queries f...
research
10/27/2021

Finite Horizon Q-learning: Stability, Convergence and Simulations

Q-learning is a popular reinforcement learning algorithm. This algorithm...
research
12/06/2022

Learning to Bound Counterfactual Inference in Structural Causal Models from Observational and Randomised Data

We address the problem of integrating data from multiple observational a...
research
02/14/2012

Efficient Inference in Markov Control Problems

Markov control algorithms that perform smooth, non-greedy updates of the...
research
06/05/2019

Quantum Algorithms for Solving Dynamic Programming Problems

We present quantum algorithms for solving finite-horizon and infinite-ho...
research
07/31/2023

Approximating Counterfactual Bounds while Fusing Observational, Biased and Randomised Data Sources

We address the problem of integrating data from multiple, possibly biase...
research
06/03/2011

Experiments with Infinite-Horizon, Policy-Gradient Estimation

In this paper, we present algorithms that perform gradient ascent of the...

Please sign up or login with your details

Forgot password? Click here to reset