Koopman Spectrum Nonlinear Regulator and Provably Efficient Online Learning

06/30/2021
by   Motoya Ohnishi, et al.
0

Most modern reinforcement learning algorithms optimize a cumulative single-step cost along a trajectory. The optimized motions are often 'unnatural', representing, for example, behaviors with sudden accelerations that waste energy and lack predictability. In this work, we present a novel paradigm of controlling nonlinear systems via the minimization of the Koopman spectrum cost: a cost over the Koopman operator of the controlled dynamics. This induces a broader class of dynamical behaviors that evolve over stable manifolds such as nonlinear oscillators, closed loops, and smooth movements. We demonstrate that some dynamics realizations that are not possible with a cumulative cost are feasible in this paradigm. Moreover, we present a provably efficient online learning algorithm for our problem that enjoys a sub-linear regret bound under some structural assumptions.

READ FULL TEXT

page 21

page 25

research
06/19/2018

Online Linear Quadratic Control

We study the problem of controlling linear time-invariant systems with k...
research
03/25/2020

Logarithmic Regret Bound in Partially Observable Linear Dynamical Systems

We study the problem of adaptive control in partially observable linear ...
research
06/09/2020

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

Prisoner's Dilemma mainly treat the choice to cooperate or defect as an ...
research
02/07/2021

Non-stationary Online Learning with Memory and Non-stochastic Control

We study the problem of Online Convex Optimization (OCO) with memory, wh...
research
05/28/2022

History-Restricted Online Learning

We introduce the concept of history-restricted no-regret online learning...
research
01/21/2020

TopRank+: A Refinement of TopRank Algorithm

Online learning to rank is a core problem in machine learning. In Lattim...
research
07/03/2017

Generalization Properties of Doubly Online Learning Algorithms

Doubly online learning algorithms are scalable kernel methods that perfo...

Please sign up or login with your details

Forgot password? Click here to reset