One-shot, Offline and Production-Scalable PID Optimisation with Deep Reinforcement Learning

10/25/2022
by   Zacharaya Shabka, et al.
0

Proportional-integral-derivative (PID) control underlies more than 97% of automated industrial processes. Controlling these processes effectively with respect to some specified set of performance goals requires finding an optimal set of PID parameters to moderate the PID loop. Tuning these parameters is a long and exhaustive process. A method (patent pending) based on deep reinforcement learning is presented that learns a relationship between generic system properties (e.g. resonance frequency), a multi-objective performance goal and optimal PID parameter values. Performance is demonstrated in the context of a real optical switching product of the foremost manufacturer of such devices globally. Switching is handled by piezoelectric actuators where switching time and optical loss are derived from the speed and stability of actuator-control processes respectively. The method achieves a 5× improvement in the number of actuators that fall within the most challenging target switching speed, ≥ 20% improvement in mean switching speed at the same optical loss and ≥ 75% reduction in performance inconsistency when temperature varies between 5 and 73 degrees celcius. Furthermore, once trained (which takes 𝒪(hours)), the model generates actuator-unique PID parameters in a one-shot inference process that takes 𝒪(ms) in comparison to up to 𝒪(week) required for conventional tuning methods, therefore accomplishing these performance improvements whilst achieving up to a 10^6× speed-up. After training, the method can be applied entirely offline, incurring effectively zero optimisation-overhead in production.

READ FULL TEXT
research
12/14/2022

Quantum Control based on Deep Reinforcement Learning

In this thesis, we consider two simple but typical control problems and ...
research
08/07/2023

Optimizing the switching operation in monoclonal antibody production: Economic MPC and reinforcement learning

Monoclonal antibodies (mAbs) have emerged as indispensable assets in med...
research
09/19/2022

Meta-Reinforcement Learning for Adaptive Control of Second Order Systems

Meta-learning is a branch of machine learning which aims to synthesize d...
research
04/27/2021

An Event-based Parameter Switching Method for Controlling Cybersecurity Dynamics

This paper proposes a new event-based parameter switching method for the...
research
10/10/2021

Multi-condition multi-objective optimization using deep reinforcement learning

A multi-condition multi-objective optimization method that can find Pare...
research
08/22/2022

Mission Apollo: Landing Optical Circuit Switching at Datacenter Scale

In this paper, we describe Apollo, to the best of our knowledge, the wor...
research
07/21/2022

Multi-Asset Closed-Loop Reservoir Management Using Deep Reinforcement Learning

Closed-loop reservoir management (CLRM), in which history matching and p...

Please sign up or login with your details

Forgot password? Click here to reset