Playing Atari Ball Games with Hierarchical Reinforcement Learning

09/27/2019
by   Hua Huang, et al.
0

Human beings are particularly good at reasoning and inference from just a few examples. When facing new tasks, humans will leverage knowledge and skills learned before, and quickly integrate them with the new task. In addition to learning by experimentation, human also learn socio-culturally through instructions and learning by example. In this way humans can learn much faster compared with most current artificial intelligence algorithms in many tasks. In this paper, we test the idea of speeding up machine learning through social learning. We argue that in solving real-world problems, especially when the task is designed by humans, and/or for humans, there are typically instructions from user manuals and/or human experts which give guidelines on how to better accomplish the tasks. We argue that these instructions have tremendous value in designing a reinforcement learning system which can learn in human fashion, and we test the idea by playing the Atari games Tennis and Pong. We experimentally demonstrate that the instructions provide key information about the task, which can be used to decompose the learning task into sub-systems and construct options for the temporally extended planning, and dramatically accelerate the learning process.

READ FULL TEXT

page 6

page 7

research
09/11/2022

Meta-Reinforcement Learning via Language Instructions

Although deep reinforcement learning has recently been very successful a...
research
09/10/2018

Keep it stupid simple

Deep reinforcement learning can match and exceed human performance, but ...
research
11/28/2018

Trajectory-based Learning for Ball-in-Maze Games

Deep Reinforcement Learning has shown tremendous success in solving seve...
research
06/16/2022

How to talk so your robot will learn: Instructions, descriptions, and pragmatics

From the earliest years of our lives, humans use language to express our...
research
11/07/2016

Playing SNES in the Retro Learning Environment

Mastering a video game requires skill, tactics and strategy. While these...
research
09/17/2018

The Fast and the Flexible: training neural networks to learn to follow instructions from small data

Learning to follow human instructions is a challenging task because whil...
research
05/09/2018

Learning Coordinated Tasks using Reinforcement Learning in Humanoids

With the advent of artificial intelligence and machine learning, humanoi...

Please sign up or login with your details

Forgot password? Click here to reset