Exploiting Model Equivalences for Solving Interactive Dynamic Influence Diagrams

01/18/2014
by   Yifeng Zeng, et al.
0

We focus on the problem of sequential decision making in partially observable environments shared with other agents of uncertain types having similar or conflicting objectives. This problem has been previously formalized by multiple frameworks one of which is the interactive dynamic influence diagram (I-DID), which generalizes the well-known influence diagram to the multiagent setting. I-DIDs are graphical models and may be used to compute the policy of an agent given its belief over the physical state and others models, which changes as the agent acts and observes in the multiagent setting. As we may expect, solving I-DIDs is computationally hard. This is predominantly due to the large space of candidate models ascribed to the other agents and its exponential growth over time. We present two methods for reducing the size of the model space and stemming its exponential growth. Both these methods involve aggregating individual models into equivalence classes. Our first method groups together behaviorally equivalent models and selects only those models for updating which will result in predictive behaviors that are distinct from others in the updated model space. The second method further compacts the model space by focusing on portions of the behavioral predictions. Specifically, we cluster actionally equivalent models that prescribe identical actions at a single time step. Exactly identifying the equivalences would require us to solve all models in the initial set. We avoid this by selectively solving some of the models, thereby introducing an approximation. We discuss the error introduced by the approximation, and empirically demonstrate the improved efficiency in solving I-DIDs due to the equivalences.

READ FULL TEXT

page 30

page 31

research
02/04/2017

Solving the Brachistochrone Problem by an Influence Diagram

Influence diagrams are a decision-theoretic extension of probabilistic g...
research
03/27/2013

A Method for Using Belief Networks as Influence Diagrams

This paper demonstrates a method for using belief-network algorithms to ...
research
09/01/2014

Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

Planning for ad hoc teamwork is challenging because it involves agents c...
research
01/15/2014

AND/OR Multi-Valued Decision Diagrams (AOMDDs) for Graphical Models

Inspired by the recently introduced framework of AND/OR search spaces fo...
research
09/23/2019

Compiling Stochastic Constraint Programs to And-Or Decision Diagrams

Factored stochastic constraint programming (FSCP) is a formalism to repr...
research
02/21/2019

Policies for growth of influence networks in task-oriented groups: elitism and egalitarianism outperform welfarism

Communication or influence networks are probably the most controllable o...
research
02/13/2013

Constraining Influence Diagram Structure by Generative Planning: An Application to the Optimization of Oil Spill Response

This paper works through the optimization of a real world planning probl...

Please sign up or login with your details

Forgot password? Click here to reset