A Maintenance Planning Framework using Online and Offline Deep Reinforcement Learning

08/01/2022
by   Zaharah A. Bukhsh, et al.
0

Cost-effective asset management is an area of interest across several industries. Specifically, this paper develops a deep reinforcement learning (DRL) solution to automatically determine an optimal rehabilitation policy for continuously deteriorating water pipes. We approach the problem of rehabilitation planning in an online and offline DRL setting. In online DRL, the agent interacts with a simulated environment of multiple pipes with distinct length, material, and failure rate characteristics. We train the agent using deep Q-learning (DQN) to learn an optimal policy with minimal average costs and reduced failure probability. In offline learning, the agent uses static data, e.g., DQN replay data, to learn an optimal policy via a conservative Q-learning algorithm without further interactions with the environment. We demonstrate that DRL-based policies improve over standard preventive, corrective, and greedy planning alternatives. Additionally, learning from the fixed DQN replay dataset surpasses the online DQN setting. The results warrant that the existing deterioration profiles of water pipes consisting of large and diverse states and action trajectories provide a valuable avenue to learn rehabilitation policies in the offline setting without needing a simulator.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2021

Pessimistic Model Selection for Offline Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has demonstrated great potentials in s...
research
04/04/2023

Optimizing Irrigation Efficiency using Deep Reinforcement Learning in the Field

Agricultural irrigation is a significant contributor to freshwater consu...
research
07/15/2023

Evaluation of Deep Reinforcement Learning Algorithms for Portfolio Optimisation

We evaluate benchmark deep reinforcement learning (DRL) algorithms on th...
research
05/31/2021

Policies for the Dynamic Traveling Maintainer Problem with Alerts

Companies require modern capital assets such as wind turbines, trains an...
research
10/21/2021

Locality-Sensitive Experience Replay for Online Recommendation

Online recommendation requires handling rapidly changing user preference...
research
10/30/2017

Modeling Attention in Panoramic Video: A Deep Reinforcement Learning Approach

Panoramic video provides immersive and interactive experience by enablin...
research
01/19/2021

Deep Reinforcement Learning Optimizes Graphene Nanopores for Efficient Desalination

Two-dimensional nanomaterials, such as graphene, have been extensively s...

Please sign up or login with your details

Forgot password? Click here to reset