Transferring Multiple Policies to Hotstart Reinforcement Learning in an Air Compressor Management Problem

01/30/2023
by   Hélène Plisnier, et al.
0

Many instances of similar or almost-identical industrial machines or tools are often deployed at once, or in quick succession. For instance, a particular model of air compressor may be installed at hundreds of customers. Because these tools perform distinct but highly similar tasks, it is interesting to be able to quickly produce a high-quality controller for machine N+1 given the controllers already produced for machines 1..N. This is even more important when the controllers are learned through Reinforcement Learning, as training takes time, energy and other resources. In this paper, we apply Policy Intersection, a Policy Shaping method, to help a Reinforcement Learning agent learn to solve a new variant of a compressors control problem faster, by transferring knowledge from several previously learned controllers. We show that our approach outperforms loading an old controller, and significantly improves performance in the long run.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/15/2018

Residual Policy Learning

We present Residual Policy Learning (RPL): a simple method for improving...
research
09/28/2018

Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped

Learning controllers for bipedal robots is a challenging problem, often ...
research
06/15/2022

Automating the resolution of flight conflicts: Deep reinforcement learning in service of air traffic controllers

Dense and complex air traffic scenarios require higher levels of automat...
research
02/06/2021

Multi-Agent Deep Reinforcement Learning for Request Dispatching in Distributed-Controller Software-Defined Networking

Recently, distributed controller architectures have been quickly gaining...
research
02/03/2020

Proportional integral derivative controller assisted reinforcement learning for path following by autonomous underwater vehicles

Control theory provides engineers with a multitude of tools to design co...
research
07/11/2019

DisCoRL: Continual Reinforcement Learning via Policy Distillation

In multi-task reinforcement learning there are two main challenges: at t...
research
01/10/2019

Motion Perception in Reinforcement Learning with Dynamic Objects

In dynamic environments, learned controllers are supposed to take motion...

Please sign up or login with your details

Forgot password? Click here to reset