Safety-guaranteed Reinforcement Learning based on Multi-class Support Vector Machine

06/12/2020
by   Kwangyeon Kim, et al.
15

Several works have addressed the problem of incorporating constraints in the reinforcement learning (RL) framework, however majority of them can only guarantee the satisfaction of soft constraints. In this work, we address the problem of satisfying hard state constraints in a model-free RL setting with the deterministic system dynamics. The proposed algorithm is developed for the discrete state and action space and utilizes a multi-class support vector machine (SVM) to represent the policy. The state constraints are incorporated in the SVM optimization framework to derive an analytical solution for determining the policy parameters. This final policy converges to a solution which is guaranteed to satisfy the constraints. Additionally, the proposed formulation adheres to the Q-learning framework and thus, also guarantees convergence to the optimal solution. The algorithm is demonstrated with multiple example problems.

READ FULL TEXT

page 5

page 7

research
11/30/2015

Proximal gradient method for huberized support vector machine

The Support Vector Machine (SVM) has been used in a wide variety of clas...
research
02/14/2022

SAUTE RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

Satisfying safety constraints almost surely (or with probability one) ca...
research
03/23/2021

Assured Learning-enabled Autonomy: A Metacognitive Reinforcement Learning Framework

Reinforcement learning (RL) agents with pre-specified reward functions c...
research
12/24/2020

Assured RL: Reinforcement Learning with Almost Sure Constraints

We consider the problem of finding optimal policies for a Markov Decisio...
research
03/26/2020

A Flexible Job Shop Scheduling Representation of the Autonomous In-Space Assembly Task Assignment Problem

As in-space exploration increases, autonomous systems will play a vital ...
research
06/24/2021

Density Constrained Reinforcement Learning

We study constrained reinforcement learning (CRL) from a novel perspecti...
research
05/11/2020

A Relational Gradient Descent Algorithm For Support Vector Machine Training

We consider gradient descent like algorithms for Support Vector Machine ...

Please sign up or login with your details

Forgot password? Click here to reset