Stronger Generalization Guarantees for Robot Learning by Combining Generative Models and Real-World Data

11/16/2021
by   Abhinav Agarwal, et al.
7

We are motivated by the problem of learning policies for robotic systems with rich sensory inputs (e.g., vision) in a manner that allows us to guarantee generalization to environments unseen during training. We provide a framework for providing such generalization guarantees by leveraging a finite dataset of real-world environments in combination with a (potentially inaccurate) generative model of environments. The key idea behind our approach is to utilize the generative model in order to implicitly specify a prior over policies. This prior is updated using the real-world dataset of environments by minimizing an upper bound on the expected cost across novel environments derived via Probably Approximately Correct (PAC)-Bayes generalization theory. We demonstrate our approach on two simulated systems with nonlinear/hybrid dynamics and rich sensing modalities: (i) quadrotor navigation with an onboard vision sensor, and (ii) grasping objects using a depth sensor. Comparisons with prior work demonstrate the ability of our approach to obtain stronger generalization guarantees by utilizing generative models. We also present hardware experiments for validating our bounds for the grasping task.

READ FULL TEXT

page 1

page 5

page 6

research
02/28/2020

Probably Approximately Correct Vision-Based Planning using Motion Primitives

This paper presents a deep reinforcement learning approach for synthesiz...
research
06/11/2018

PAC-Bayes Control: Synthesizing Controllers that Provably Generalize to Novel Environments

Our goal is to synthesize controllers for robots that provably generaliz...
research
07/13/2021

Distributionally Robust Policy Learning via Adversarial Environment Generation

Our goal is to train control policies that generalize well to unseen env...
research
06/25/2021

Task-Driven Out-of-Distribution Detection with Statistical Guarantees for Robot Learning

Our goal is to perform out-of-distribution (OOD) detection, i.e., to det...
research
02/11/2022

Failure Prediction with Statistical Guarantees for Vision-Based Robot Control

We are motivated by the problem of performing failure prediction for saf...
research
01/20/2022

Sim-to-Lab-to-Real: Safe Reinforcement Learning with Shielding and Generalization Guarantees

Safety is a critical component of autonomous systems and remains a chall...
research
11/16/2021

Learning Provably Robust Motion Planners Using Funnel Libraries

This paper presents an approach for learning motion planners that are ac...

Please sign up or login with your details

Forgot password? Click here to reset