Learning Temporal Strategic Relationships using Generative Adversarial Imitation Learning

05/13/2018
by   Tharindu Fernando, et al.
0

This paper presents a novel framework for automatic learning of complex strategies in human decision making. The task that we are interested in is to better facilitate long term planning for complex, multi-step events. We observe temporal relationships at the subtask level of expert demonstrations, and determine the different strategies employed in order to successfully complete a task. To capture the relationship between the subtasks and the overall goal, we utilise two external memory modules, one for capturing dependencies within a single expert demonstration, such as the sequential relationship among different sub tasks, and a global memory module for modelling task level characteristics such as best practice employed by different humans based on their domain expertise. Furthermore, we demonstrate how the hidden state representation of the memory can be used as a reward signal to smooth the state transitions, eradicating subtle changes. We evaluate the effectiveness of the proposed model for an autonomous highway driving application, where we demonstrate its capability to learn different expert policies and outperform state-of-the-art methods. The scope in industrial applications extends to any robotics and automation application which requires learning from complex demonstrations containing series of subtasks.

READ FULL TEXT

page 7

page 8

research
03/26/2017

InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations

The goal of imitation learning is to mimic expert behavior without acces...
research
12/26/2020

Imitation Learning for High Precision Peg-in-Hole Tasks

Industrial robot manipulators are not able to match the precision and sp...
research
10/02/2018

Video Imitation GAN: Learning control policies by imitating raw videos using generative adversarial reward estimation

Natural imitation in humans usually consists of mimicking visual demonst...
research
11/19/2019

Adversarial Inverse Reinforcement Learning for Decision Making in Autonomous Driving

Generative Adversarial Imitation Learning (GAIL) is an efficient way to ...
research
04/02/2020

Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection

This paper presents a novel framework for Speech Activity Detection (SAD...
research
02/15/2023

Understanding Expertise through Demonstrations: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning

Offline inverse reinforcement learning (Offline IRL) aims to recover the...
research
01/16/2019

Memory Augmented Deep Generative models for Forecasting the Next Shot Location in Tennis

This paper presents a novel framework for predicting shot location and t...

Please sign up or login with your details

Forgot password? Click here to reset