An Active Learning Framework for Efficient Robust Policy Search

01/01/2019
by   Sai Kiran Narayanaswami, et al.
10

Robust Policy Search is the problem of learning policies that do not degrade in performance when subject to unseen environment model parameters. It is particularly relevant for transferring policies learned in a simulation environment to the real world. Several existing approaches involve sampling large batches of trajectories which reflect the differences in various possible environments, and then selecting some subset of these to learn robust policies, such as the ones that result in the worst performance. We propose an active learning based framework, EffAcTS, to selectively choose model parameters for this purpose so as to collect only as much data as necessary to select such a subset. We apply this framework to an existing method, namely EPOpt, and experimentally validate the gains in sample efficiency and the performance of our approach on standard continuous control tasks. We also present a Multi-Task Learning perspective to the problem of Robust Policy Search, and draw connections from our proposed framework to existing work on Multi-Task Learning.

READ FULL TEXT
research
11/21/2022

PartAL: Efficient Partial Active Learning in Multi-Task Visual Settings

Multi-task learning is central to many real-world applications. Unfortun...
research
02/20/2017

Learning to Multi-Task by Active Sampling

One of the long-standing challenges in Artificial Intelligence for learn...
research
06/05/2018

Multi-Task Active Learning for Neural Semantic Role Labeling on Low Resource Conversational Corpus

Most Semantic Role Labeling (SRL) approaches are supervised methods whic...
research
09/06/2022

Cross apprenticeship learning framework: Properties and solution approaches

Apprenticeship learning is a framework in which an agent learns a policy...
research
02/07/2020

Ready Policy One: World Building Through Active Learning

Model-Based Reinforcement Learning (MBRL) offers a promising direction f...
research
02/02/2022

Active Multi-Task Representation Learning

To leverage the power of big data from source tasks and overcome the sca...
research
04/28/2023

Learning adaptive manipulation of objects with revolute joint: A case study on varied cabinet doors opening

This paper introduces a learning-based framework for robot adaptive mani...

Please sign up or login with your details

Forgot password? Click here to reset