The Limits of Learning and Planning: Minimal Sufficient Information Transition Systems

12/01/2022
by   Basak Sakcak, et al.
0

In this paper, we view a policy or plan as a transition system over a space of information states that reflect a robot's or other observer's perspective based on limited sensing, memory, computation, and actuation. Regardless of whether policies are obtained by learning algorithms, planning algorithms, or human insight, we want to know the limits of feasibility for given robot hardware and tasks. Toward the quest to find the best policies, we establish in a general setting that minimal information transition systems (ITSs) exist up to reasonable equivalence assumptions, and are unique under some general conditions. We then apply the theory to generate new insights into several problems, including optimal sensor fusion/filtering, solving basic planning tasks, and finding minimal representations for feasible policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/17/2023

A Mathematical Characterization of Minimally Sufficient Robot Brains

This paper addresses the lower limits of encoding and processing the inf...
research
09/25/2018

Finding plans subject to stipulations on what information they divulge

Motivated by applications where privacy is important, we consider planni...
research
06/20/2023

The Unintended Consequences of Discount Regularization: Improving Regularization in Certainty Equivalence Reinforcement Learning

Discount regularization, using a shorter planning horizon when calculati...
research
06/26/2020

What can I do here? A Theory of Affordances in Reinforcement Learning

Reinforcement learning algorithms usually assume that all actions are al...
research
01/12/2022

Planning in Observable POMDPs in Quasipolynomial Time

Partially Observable Markov Decision Processes (POMDPs) are a natural an...
research
06/25/2021

Decomposition of transition systems into sets of synchronizing state machines

Transition systems (TS) and Petri nets (PN) are important models of comp...
research
07/23/2018

Toward a language-theoretic foundation for planning and filtering

We address problems underlying the algorithmic question of automating th...

Please sign up or login with your details

Forgot password? Click here to reset