Balancing a CartPole System with Reinforcement Learning – A Tutorial

06/08/2020
by   Swagat Kumar, et al.
0

In this paper, we provide the details of implementing various reinforcement learning (RL) algorithms for controlling a Cart-Pole system. In particular, we describe various RL concepts such as Q-learning, Deep Q Networks (DQN), Double DQN, Dueling networks, (prioritized) experience replay and show their effect on the learning performance. In the process, the readers will be introduced to OpenAI/Gym and Keras utilities used for implementing the above concepts. It is observed that DQN with PER provides best performance among all other architectures being able to solve the problem within 150 episodes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2018

An Introduction to Deep Reinforcement Learning

Deep reinforcement learning is the combination of reinforcement learning...
research
02/09/2021

Reverb: A Framework For Experience Replay

A central component of training in Reinforcement Learning (RL) is Experi...
research
06/09/2017

Symmetry Learning for Function Approximation in Reinforcement Learning

In this paper we explore methods to exploit symmetries for ensuring samp...
research
04/11/2022

Implementing Online Reinforcement Learning with Temporal Neural Networks

A Temporal Neural Network (TNN) architecture for implementing efficient ...
research
02/02/2020

SARSA(0) Reinforcement Learning over Fully Homomorphic Encryption

We consider a cloud-based control architecture in which the local plants...
research
01/14/2022

Reinforcement Learning based Air Combat Maneuver Generation

The advent of artificial intelligence technology paved the way of many r...
research
08/23/2021

Collect Infer – a fresh look at data-efficient Reinforcement Learning

This position paper proposes a fresh look at Reinforcement Learning (RL)...

Please sign up or login with your details

Forgot password? Click here to reset