Near-optimal Deep Reinforcement Learning Policies from Data for Zone Temperature Control

03/10/2022
by   Loris Di Natale, et al.
0

Replacing poorly performing existing controllers with smarter solutions will decrease the energy intensity of the building sector. Recently, controllers based on Deep Reinforcement Learning (DRL) have been shown to be more effective than conventional baselines. However, since the optimal solution is usually unknown, it is still unclear if DRL agents are attaining near-optimal performance in general or if there is still a large gap to bridge. In this paper, we investigate the performance of DRL agents compared to the theoretically optimal solution. To that end, we leverage Physically Consistent Neural Networks (PCNNs) as simulation environments, for which optimal control inputs are easy to compute. Furthermore, PCNNs solely rely on data to be trained, avoiding the difficult physics-based modeling phase, while retaining physical consistency. Our results hint that DRL agents not only clearly outperform conventional rule-based controllers, they furthermore attain near-optimal performance.

READ FULL TEXT
research
12/17/2020

Towards Optimal District Heating Temperature Control in China with Deep Reinforcement Learning

Achieving efficiency gains in Chinese district heating networks, thereby...
research
08/25/2023

Pretty darn good control: when are approximate solutions better than approximate models

Existing methods for optimal control struggle to deal with the complexit...
research
07/31/2023

Distributionally Robust Safety Filter for Learning-Based Control in Active Distribution Systems

Operational constraint violations may occur when deep reinforcement lear...
research
05/06/2022

Vehicle management in a modular production context using Deep Q-Learning

We investigate the feasibility of deploying Deep-Q based deep reinforcem...
research
02/24/2023

Prioritized Trace Selection: Towards High-Performance DRL-based Network Controllers

Deep Reinforcement Learning (DRL) based controllers offer high performan...
research
01/23/2021

Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Dynamic platforms that operate over manyunique terrain conditions typica...
research
10/02/2020

Modeling all alternative solutions for highly renewable energy systems

As the world is transitioning towards highly renewable energy systems, a...

Please sign up or login with your details

Forgot password? Click here to reset