Learning Compact Models for Planning with Exogenous Processes

09/30/2019
by   Rohan Chitnis, et al.
0

We address the problem of approximate model minimization for MDPs in which the state is partitioned into endogenous and (much larger) exogenous components. An exogenous state variable is one whose dynamics are independent of the agent's actions. We formalize the mask-learning problem, in which the agent must choose a subset of exogenous state variables to reason about when planning; doing planning in such a reduced state space can often be significantly more efficient than planning in the full model. We then explore the various value functions at play within this setting, and describe conditions under which a policy for a reduced model will be optimal for the full MDP. The analysis leads us to a tractable approximate algorithm that draws upon the notion of mutual information among exogenous state variables. We validate our approach in simulated robotic manipulation domains where a robot is placed in a busy environment, in which there are many other agents also interacting with the objects. Visit http://tinyurl.com/chitnis-exogenous for a supplementary video.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2014

Extreme State Aggregation Beyond MDPs

We consider a Reinforcement Learning setup where an agent interacts with...
research
07/13/2020

Efficient Planning in Large MDPs with Weak Linear Function Approximation

Large-scale Markov decision processes (MDPs) require planning algorithms...
research
12/12/2009

Closing the Learning-Planning Loop with Predictive State Representations

A central problem in artificial intelligence is that of planning to maxi...
research
07/26/2020

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Meta-planning, or learning to guide planning from experience, is a promi...
research
05/20/2020

MDPs with Unawareness in Robotics

We formalize decision-making problems in robotics and automated control ...
research
05/03/2015

Metareasoning for Planning Under Uncertainty

The conventional model for online planning under uncertainty assumes tha...
research
06/29/2020

Exploring Optimal Control With Observations at a Cost

There has been a current trend in reinforcement learning for healthcare ...

Please sign up or login with your details

Forgot password? Click here to reset