Path Integral Networks: End-to-End Differentiable Optimal Control

06/29/2017
by   Masashi Okada, et al.
0

In this paper, we introduce Path Integral Networks (PI-Net), a recurrent network representation of the Path Integral optimal control algorithm. The network includes both system dynamics and cost models, used for optimal control based planning. PI-Net is fully differentiable, learning both dynamics and cost models end-to-end by back-propagation and stochastic gradient descent. Because of this, PI-Net can learn to plan. PI-Net has several advantages: it can generalize to unseen states thanks to planning, it can be applied to continuous control tasks, and it allows for a wide variety learning schemes, including imitation and reinforcement learning. Preliminary experiment results show that PI-Net, trained by imitation learning, can mimic control demonstrations for two simulated problems; a linear system and a pendulum swing-up problem. We also show that PI-Net is able to learn dynamics and cost models latent in the demonstrations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/31/2018

Differentiable MPC for End-to-end Planning and Control

We present foundations for using Model Predictive Control (MPC) as a dif...
research
03/20/2017

QMDP-Net: Deep Learning for Planning under Partial Observability

This paper introduces the QMDP-net, a neural network architecture for pl...
research
11/30/2021

Path Integral Sampler: a stochastic control approach for sampling

We present Path Integral Sampler (PIS), a novel algorithm to draw sample...
research
04/02/2018

Universal Planning Networks

A key challenge in complex visuomotor control is learning abstract repre...
research
06/10/2021

Differentiable Robust LQR Layers

This paper proposes a differentiable robust LQR layer for reinforcement ...
research
05/07/2020

Plan2Vec: Unsupervised Representation Learning by Latent Plans

In this paper we introduce plan2vec, an unsupervised representation lear...
research
05/13/2020

Adaptive Smoothing Path Integral Control

In Path Integral control problems a representation of an optimally contr...

Please sign up or login with your details

Forgot password? Click here to reset