CLIC: Curriculum Learning and Imitation for feature Control in non-rewarding environments

01/28/2019
by   Pierre Fournier, et al.
0

In this paper, we propose an unsupervised reinforcement learning agent called CLIC for Curriculum Learning and Imitation for Control. This agent learns to control features in its environment without external rewards, and observes the actions of a third party agent, Bob, who does not necessarily provide explicit guidance. CLIC selects which feature to train on and what to imitate from Bob's behavior by maximizing its learning progress. We show that CLIC can effectively identify helpful behaviors in Bob's actions, and imitate them to control the environment faster. CLIC can also follow Bob when he acts as a mentor and provides ordered demonstrations. Finally, when Bob controls features than the agent cannot, or in presence of a hierarchy between aspects of the environment, we show that CLIC ignores non-reproducible and already mastered behaviors, resulting in a greater benefit from imitation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/22/2020

PICO: Primitive Imitation for COntrol

In this work, we explore a novel framework for control of complex system...
research
02/24/2021

PsiPhi-Learning: Reinforcement Learning with Demonstrations using Successor Features and Inverse Temporal Difference Learning

We study reinforcement learning (RL) with no-reward demonstrations, a se...
research
08/21/2021

MimicBot: Combining Imitation and Reinforcement Learning to win in Bot Bowl

This paper describe an hybrid agent trained to play in Fantasy Football ...
research
10/02/2019

Unsupervised Doodling and Painting with Improved SPIRAL

We investigate using reinforcement learning agents as generative models ...
research
06/05/2018

Mix&Match - Agent Curricula for Reinforcement Learning

We introduce Mix&Match (M&M) - a training framework designed to facilita...
research
06/13/2019

Curriculum Learning for Cumulative Return Maximization

Curriculum learning has been successfully used in reinforcement learning...
research
06/03/2011

Accelerating Reinforcement Learning through Implicit Imitation

Imitation can be viewed as a means of enhancing learning in multiagent e...

Please sign up or login with your details

Forgot password? Click here to reset