On Convex Data-Driven Inverse Optimal Control for Nonlinear, Non-stationary and Stochastic Systems

06/24/2023
by   Emiland Garrabe, et al.
0

This paper is concerned with a finite-horizon inverse control problem, which has the goal of inferring, from observations, the possibly non-convex and non-stationary cost driving the actions of an agent. In this context, we present a result that enables cost estimation by solving an optimization problem that is convex even when the agent cost is not and when the underlying dynamics is nonlinear, non-stationary and stochastic. To obtain this result, we also study a finite-horizon forward control problem that has randomized policies as decision variables. For this problem, we give an explicit expression for the optimal solution. Moreover, we turn our findings into algorithmic procedures and we show the effectiveness of our approach via both in-silico and experimental validations with real hardware. All the experiments confirm the effectiveness of our approach.

READ FULL TEXT
research
12/29/2020

Learning non-stationary Langevin dynamics from stochastic observations of latent trajectories

Many complex systems operating far from the equilibrium exhibit stochast...
research
06/26/2023

Beyond dynamic programming

In this paper, we present Score-life programming, a novel theoretical ap...
research
07/20/2013

Non-stationary Stochastic Optimization

We consider a non-stationary variant of a sequential stochastic optimiza...
research
03/18/2021

Data-driven Coarse-grained Modeling of Non-equilibrium Systems

Modeling a high-dimensional Hamiltonian system in reduced dimensions wit...
research
05/25/2022

Non-stationary Bandits with Knapsacks

In this paper, we study the problem of bandits with knapsacks (BwK) in a...
research
01/21/2023

Open-World Multi-Task Control Through Goal-Aware Representation Learning and Adaptive Horizon Prediction

We study the problem of learning goal-conditioned policies in Minecraft,...
research
03/19/2021

On a probabilistic approach to synthesize control policies from example datasets

This paper is concerned with the design of control policies from example...

Please sign up or login with your details

Forgot password? Click here to reset