MILP-based Imitation Learning for HVAC control

by   Huy Truong Dinh, et al.

To optimize the operation of a HVAC system with advanced techniques such as artificial neural network, previous studies usually need forecast information in their method. However, the forecast information inevitably contains errors all the time, which degrade the performance of the HVAC operation. Hence, in this study, we propose MILP-based imitation learning method to control a HVAC system without using the forecast information in order to reduce energy cost and maintain thermal comfort at a given level. Our proposed controller is a deep neural network (DNN) trained by using data labeled by a MILP solver with historical data. After training, our controller is used to control the HVAC system with real-time data. For comparison, we also develop a second method named forecast-based MILP which control the HVAC system using the forecast information. The performance of the two methods is verified by using real outdoor temperatures and real day-ahead prices in Detroit city, Michigan, United States. Numerical results clearly show that the performance of the MILP-based imitation learning is better than that of the forecast-based MILP method in terms of hourly power consumption, daily energy cost, and thermal comfort. Moreover, the difference between results of the MILP-based imitation learning method and optimal results is almost negligible. These optimal results are achieved only by using the MILP solver at the end of a day when we have full information on the weather and prices for the day.


page 1

page 2

page 3

page 4


Robust Imitation Learning from Noisy Demonstrations

Learning from noisy demonstrations is a practical but highly challenging...

Cross-Domain Imitation Learning with a Dual Structure

In this paper, we consider cross-domain imitation learning (CDIL) in whi...

Motion Generation Using Bilateral Control-Based Imitation Learning with Autoregressive Learning

Robots that can execute various tasks automatically on behalf of humans ...

Digital Twins based Day-ahead Integrated Energy System Scheduling under Load and Renewable Energy Uncertainties

By constructing digital twins (DT) of an integrated energy system (IES),...

A New Autoregressive Neural Network Model with Command Compensation for Imitation Learning Based on Bilateral Control

In the near future, robots are expected to work with humans or operate a...

Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy

The energy output of photovoltaic (PV) power plants depends on the envir...

Training Electric Vehicle Charging Controllers with Imitation Learning

The problem of coordinating the charging of electric vehicles gains more...