research
          
      
      ∙
      12/06/2022
    Towards a more efficient computation of individual attribute and policy contribution for post-hoc explanation of cooperative multi-agent systems using Myerson values
A quantitative assessment of the global importance of an agent in a team...
          
            research
          
      
      ∙
      12/18/2021
    Exploiting Expert-guided Symmetry Detection in Markov Decision Processes
Offline estimation of the dynamical model of a Markov Decision Process (...
          
            research
          
      
      ∙
      11/19/2021
    Expert-Guided Symmetry Detection in Markov Decision Processes
Learning a Markov Decision Process (MDP) from a fixed batch of trajector...
          
            research
          
      
      ∙
      05/27/2021
    Exploitation vs Caution: Risk-sensitive Policies for Offline Learning
Offline model learning for planning is a branch of machine learning that...
          
            research
          
      
      ∙
      10/05/2020
     
             
  
  
     
                             share
 share