Inverse Reinforcement Learning with Missing Data

11/16/2019
by   Tien Mai, et al.
26

We consider the problem of recovering an expert's reward function with inverse reinforcement learning (IRL) when there are missing/incomplete state-action pairs or observations in the demonstrated trajectories. This issue of missing trajectory data or information occurs in many situations, e.g., GPS signals from vehicles moving on a road network are intermittent. In this paper, we propose a tractable approach to directly compute the log-likelihood of demonstrated trajectories with incomplete/missing data. Our algorithm is efficient in handling a large number of missing segments in the demonstrated trajectories, as it performs the training with incomplete data by solving a sequence of systems of linear equations, and the number of such systems to be solved does not depend on the number of missing segments. Empirical evaluation on a real-world dataset shows that our training algorithm outperforms other conventional techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2019

Generalized Maximum Causal Entropy for Inverse Reinforcement Learning

We consider the problem of learning from demonstrated trajectories with ...
research
12/12/2019

Improved Activity Forecasting for Generating Trajectories

An efficient inverse reinforcement learning for generating trajectories ...
research
03/28/2017

Inverse Reinforcement Learning from Incomplete Observation Data

Inverse reinforcement learning (IRL) aims to explain observed strategic ...
research
06/09/2020

Causal Discovery from Incomplete Data using An Encoder and Reinforcement Learning

Discovering causal structure among a set of variables is a fundamental p...
research
03/01/2022

Recovery of Missing Sensor Data by Reconstructing Time-varying Graph Signals

Wireless sensor networks are among the most promising technologies of th...
research
06/30/2021

MissFormer: (In-)attention-based handling of missing observations for trajectory filtering and prediction

In applications such as object tracking, time-series data inevitably car...
research
07/10/2018

Handling Incomplete Heterogeneous Data using VAEs

Variational autoencoders (VAEs), as well as other generative models, hav...

Please sign up or login with your details

Forgot password? Click here to reset