Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

05/20/2017
by   Daniel Hein, et al.
0

The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement learning (RL) benchmark that aims at being realistic by including a variety of aspects found in industrial applications, like continuous state and action spaces, a high dimensional, partially observable state space, delayed effects, and complex stochasticity. The experimental results of PSO-P on IB are compared to results of closed-form control policies derived from the model-based Recurrent Control Neural Network (RCNN) and the model-free Neural Fitted Q-Iteration (NFQ). Experiments show that PSO-P is not only of interest for academic benchmarks, but also for real-world industrial applications, since it also yielded the best performing policy in our IB setting. Compared to other well established RL techniques, PSO-P produced outstanding results in performance and robustness, requiring only a relatively low amount of effort in finding adequate parameters or making complex design decisions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2017

A Benchmark Environment Motivated by Industrial Control Problems

In the research area of reinforcement learning (RL), frequently novel an...
research
08/12/2020

Overcoming Model Bias for Robust Offline Deep Reinforcement Learning

State-of-the-art reinforcement learning algorithms mostly rely on being ...
research
07/20/2020

Interpretable Control by Reinforcement Learning

In this paper, three recently introduced reinforcement learning (RL) met...
research
10/19/2016

Particle Swarm Optimization for Generating Interpretable Fuzzy Reinforcement Learning Policies

Fuzzy controllers are efficient and interpretable system controllers for...
research
12/12/2017

Interpretable Policies for Reinforcement Learning by Genetic Programming

The search for interpretable reinforcement learning policies is of high ...
research
08/11/2023

A Deep Recurrent-Reinforcement Learning Method for Intelligent AutoScaling of Serverless Functions

Function-as-a-Service (FaaS) introduces a lightweight, function-based cl...
research
11/29/2022

Approximating Martingale Process for Variance Reduction in Deep Reinforcement Learning with Large State Space

Approximating Martingale Process (AMP) is proven to be effective for var...

Please sign up or login with your details

Forgot password? Click here to reset