Deep vs. Deep Bayesian: Reinforcement Learning on a Multi-Robot Competitive Experiment

07/21/2020
by   Jingyi Huang, et al.
0

Deep Reinforcement Learning (RL) experiments are commonly performed in simulated environment, due to the tremendous training sample demand from deep neural networks. However, model-based Deep Bayesian RL, such as Deep PILCO, allows a robot to learn good policies within few trials in the real world. Although Deep PILCO has been applied on many single-robot tasks, in here we propose, for the first time, an application of Deep PILCO on a multi-robot confrontation game, and compare the algorithm with a model-free Deep RL algorithm, Deep Q-Learning. Our experiments show that Deep PILCO significantly outperforms Deep Q-Learning in learning efficiency and scalability. We conclude that sample-efficient Deep Bayesian learning algorithms have great prospects on competitive games where the agent aims to win the opponents in the real world, as opposed to simulated applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

page 6

research
03/05/2021

MAMBPO: Sample-efficient multi-robot reinforcement learning using learned world models

Multi-robot systems can benefit from reinforcement learning (RL) algorit...
research
06/26/2019

Regularized Hierarchical Policies for Compositional Transfer in Robotics

The successful application of flexible, general learning algorithms -- s...
research
08/16/2022

A Walk in the Park: Learning to Walk in 20 Minutes With Model-Free Reinforcement Learning

Deep reinforcement learning is a promising approach to learning policies...
research
03/01/2019

Model-Based Reinforcement Learning for Atari

Model-free reinforcement learning (RL) can be used to learn effective po...
research
06/02/2018

DAQN: Deep Auto-encoder and Q-Network

The deep reinforcement learning method usually requires a large number o...
research
03/02/2022

Improving the Diversity of Bootstrapped DQN via Noisy Priors

Q-learning is one of the most well-known Reinforcement Learning algorith...
research
08/06/2020

Deep Reinforcement Learning for Tactile Robotics: Learning to Type on a Braille Keyboard

Artificial touch would seem well-suited for Reinforcement Learning (RL),...

Please sign up or login with your details

Forgot password? Click here to reset