Learning to Make Predictions In Partially Observable Environments Without a Generative Model

01/16/2014
by   Erik Talvitie, et al.
0

When faced with the problem of learning a model of a high-dimensional environment, a common approach is to limit the model to make only a restricted set of predictions, thereby simplifying the learning problem. These partial models may be directly useful for making decisions or may be combined together to form a more complete, structured model. However, in partially observable (non-Markov) environments, standard model-learning methods learn generative models, i.e. models that provide a probability distribution over all possible futures (such as POMDPs). It is not straightforward to restrict such models to make only certain predictions, and doing so does not always simplify the learning problem. In this paper we present prediction profile models: non-generative partial models for partially observable systems that make only a given set of predictions, and are therefore far simpler than generative models in some cases. We formalize the problem of learning a prediction profile model as a transformation of the original model-learning problem, and show empirically that one can learn prediction profile models that make a small set of important predictions even in systems that are too complex for standard generative models.

READ FULL TEXT
research
06/07/2023

On the Use of Generative Models in Observational Causal Analysis

The use of a hypothetical generative model was been suggested for causal...
research
10/06/2022

Content-Based Search for Deep Generative Models

The growing proliferation of pretrained generative models has made it in...
research
08/23/2023

Improving Generative Model-based Unfolding with Schrödinger Bridges

Machine learning-based unfolding has enabled unbinned and high-dimension...
research
04/22/2016

Learning a Tree-Structured Ising Model in Order to Make Predictions

We study the problem of learning a tree graphical model from samples suc...
research
05/23/2018

Dyna Planning using a Feature Based Generative Model

Dyna-style reinforcement learning is a powerful approach for problems wh...
research
04/25/2018

Generative Temporal Models with Spatial Memory for Partially Observed Environments

In model-based reinforcement learning, generative and temporal models of...
research
02/08/2019

Generating the support with extreme value losses

When optimizing against the mean loss over a distribution of predictions...

Please sign up or login with your details

Forgot password? Click here to reset