Provable Representation Learning for Imitation Learning via Bi-level Optimization

02/24/2020
by   Sanjeev Arora, et al.
6

A common strategy in modern learning systems is to learn a representation that is useful for many tasks, a.k.a. representation learning. We study this strategy in the imitation learning setting for Markov decision processes (MDPs) where multiple experts' trajectories are available. We formulate representation learning as a bi-level optimization problem where the "outer" optimization tries to learn the joint representation and the "inner" optimization encodes the imitation learning setup and tries to learn task-specific parameters. We instantiate this framework for the imitation learning settings of behavior cloning and observation-alone. Theoretically, we show using our framework that representation learning can provide sample complexity benefits for imitation learning in both settings. We also provide proof-of-concept experiments to verify our theory.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

An Empirical Investigation of Representation Learning for Imitation

Imitation learning often needs a large demonstration set in order to han...
research
12/01/2022

Multi-Task Imitation Learning for Linear Dynamical Systems

We study representation learning for efficient imitation learning over l...
research
02/27/2020

Provably Efficient Third-Person Imitation from Offline Observation

Domain adaptation in imitation learning represents an essential step tow...
research
05/26/2021

Provable Representation Learning for Imitation with Contrastive Fourier Features

In imitation learning, it is common to learn a behavior policy to match ...
research
10/12/2022

Travel the Same Path: A Novel TSP Solving Strategy

In this paper, we provide a novel strategy for solving Traveling Salesma...
research
06/22/2019

Learning Belief Representations for Imitation Learning in POMDPs

We consider the problem of imitation learning from expert demonstrations...
research
01/31/2020

Domain-Adversarial and -Conditional State Space Model for Imitation Learning

State representation learning (SRL) in partially observable Markov decis...

Please sign up or login with your details

Forgot password? Click here to reset