Gaussian Process Policy Optimization

03/02/2020
by   Ashish Rao, et al.
0

We propose a novel actor-critic, model-free reinforcement learning algorithm which employs a Bayesian method of parameter space exploration to solve environments. A Gaussian process is used to learn the expected return of a policy given the policy's parameters. The system is trained by updating the parameters using gradient descent on a new surrogate loss function consisting of the Proximal Policy Optimization 'Clipped' loss function and a bonus term representing the expected improvement acquisition function given by the Gaussian process. This new method is shown to be comparable to and at times empirically outperform current algorithms on environments that simulate robotic locomotion using the MuJoCo physics engine.

READ FULL TEXT
research
10/01/2022

Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States

Actor-critic (AC) algorithms are a class of model-free deep reinforcemen...
research
05/10/2014

A Hybrid Monte Carlo Architecture for Parameter Optimization

Much recent research has been conducted in the area of Bayesian learning...
research
05/20/2022

Sigmoidally Preconditioned Off-policy Learning:a new exploration method for reinforcement learning

One of the major difficulties of reinforcement learning is learning from...
research
09/04/2020

Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization

Continuous control is a widely applicable area of reinforcement learning...
research
10/21/2018

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Optimization of parameterized policies for reinforcement learning (RL) i...
research
09/08/2021

ADER:Adapting between Exploration and Robustness for Actor-Critic Methods

Combining off-policy reinforcement learning methods with function approx...
research
10/21/2020

Logistic Q-Learning

We propose a new reinforcement learning algorithm derived from a regular...

Please sign up or login with your details

Forgot password? Click here to reset