Learning from humans: combining imitation and deep reinforcement learning to accomplish human-level performance on a virtual foraging task

03/11/2022
by   Vittorio Giammarino, et al.
0

We develop a method to learn bio-inspired foraging policies using human data. We conduct an experiment where humans are virtually immersed in an open field foraging environment and are trained to collect the highest amount of rewards. A Markov Decision Process (MDP) framework is introduced to model the human decision dynamics. Then, Imitation Learning (IL) based on maximum likelihood estimation is used to train Neural Networks (NN) that map human decisions to observed states. The results show that passive imitation substantially underperforms humans. We further refine the human-inspired policies via Reinforcement Learning (RL), using on-policy algorithms that are more suitable to learn from pre-trained networks. We show that the combination of IL and RL can match human results and that good performance strongly depends on an egocentric representation of the environment. The developed methodology can be used to efficiently learn policies for unmanned vehicles which have to solve missions in an open field environment.

READ FULL TEXT

page 3

page 4

page 16

research
12/14/2020

Active Hierarchical Imitation and Reinforcement Learning

Humans can leverage hierarchical structures to split a task into sub-tas...
research
03/27/2020

Modeling 3D Shapes by Reinforcement Learning

We explore how to enable machines to model 3D shapes like human modelers...
research
10/11/2018

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Humans are experts at high-fidelity imitation -- closely mimicking a dem...
research
07/03/2019

Integration of Imitation Learning using GAIL and Reinforcement Learning using Task-achievement Rewards via Probabilistic Generative Model

Integration of reinforcement learning and imitation learning is an impor...
research
09/20/2022

Optimizing Crop Management with Reinforcement Learning and Imitation Learning

Crop management, including nitrogen (N) fertilization and irrigation man...
research
09/10/2018

Keep it stupid simple

Deep reinforcement learning can match and exceed human performance, but ...
research
06/29/2023

Learning Environment Models with Continuous Stochastic Dynamics

Solving control tasks in complex environments automatically through lear...

Please sign up or login with your details

Forgot password? Click here to reset