Robust Policy Search for Robot Navigation with Stochastic Meta-Policies

03/02/2020
by   Javier Garcia-Barcos, et al.
0

Bayesian optimization is an efficient nonlinear optimization method where the queries are carefully selected to gather information about the optimum location. Thus, in the context of policy search, it has been called active policy search. The main ingredients of Bayesian optimization for sample efficiency are the probabilistic surrogate model and the optimal decision heuristics. In this work, we exploit those to provide robustness to different issues for policy search algorithms. We combine several methods and show how their interaction works better than the sum of the parts. First, to deal with input noise and provide a safe and repeatable policy we use an improved version of unscented Bayesian optimization. Then, to deal with mismodeling errors and improve exploration we use stochastic meta-policies for query selection and an adaptive kernel. We compare the proposed algorithm with previous results in several optimization benchmarks and robot tasks, such as pushing objects with a robot arm, or path finding with a rover.

READ FULL TEXT

page 1

page 6

page 7

research
03/07/2016

Unscented Bayesian Optimization for Safe Robot Grasping

We address the robot grasp optimization problem of unknown objects consi...
research
11/18/2020

Cautious Bayesian Optimization for Efficient and Scalable Policy Search

Sample efficiency is one of the key factors when applying policy search ...
research
12/06/2016

Factored Contextual Policy Search with Bayesian Optimization

Scarce data is a major challenge to scaling robot learning to truly comp...
research
09/20/2017

Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search

One of the most interesting features of Bayesian optimization for direct...
research
11/13/2015

Active Contextual Entropy Search

Contextual policy search allows adapting robotic movement primitives to ...
research
06/22/2021

Local policy search with Bayesian optimization

Reinforcement learning (RL) aims to find an optimal policy by interactio...
research
05/15/2020

Excursion Search for Constrained Bayesian Optimization under a Limited Budget of Failures

When learning to ride a bike, a child falls down a number of times befor...

Please sign up or login with your details

Forgot password? Click here to reset