Offline-Online Reinforcement Learning for Energy Pricing in Office Demand Response: Lowering Energy and Data Costs

08/14/2021
by   Doseok Jang, et al.
0

Our team is proposing to run a full-scale energy demand response experiment in an office building. Although this is an exciting endeavor which will provide value to the community, collecting training data for the reinforcement learning agent is costly and will be limited. In this work, we examine how offline training can be leveraged to minimize data costs (accelerate convergence) and program implementation costs. We present two approaches to doing so: pretraining our model to warm start the experiment with simulated tasks, and using a planning model trained to simulate the real world's rewards to the agent. We present results that demonstrate the utility of offline reinforcement learning to efficient price-setting in the energy demand response problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2021

Using Meta Reinforcement Learning to Bridge the Gap between Simulation and Experiment in Energy Demand Response

Our team is proposing to run a full-scale energy demand response experim...
research
11/11/2021

Adapting Surprise Minimizing Reinforcement Learning Techniques for Transactive Control

Optimizing prices for energy demand response requires a flexible control...
research
06/10/2021

Data-driven battery operation for energy arbitrage using rainbow deep reinforcement learning

As the world seeks to become more sustainable, intelligent solutions are...
research
04/04/2023

Scalable Online Learning of Approximate Stackelberg Solutions in Energy Trading Games with Demand Response Aggregators

In this work, a Stackelberg game theoretic framework is proposed for tra...
research
10/24/2022

Energy Pricing in P2P Energy Systems Using Reinforcement Learning

The increase in renewable energy on the consumer side gives place to new...
research
05/19/2022

Data Valuation for Offline Reinforcement Learning

The success of deep reinforcement learning (DRL) hinges on the availabil...
research
11/30/2017

Demand Response in the Smart Grid: the Impact of Consumers Temporal Preferences

In Demand Response programs, price incentives might not be sufficient to...

Please sign up or login with your details

Forgot password? Click here to reset