Combined Peak Reduction and Self-Consumption Using Proximal Policy Optimization

11/27/2022
by   Thijs Peirelinck, et al.
0

Residential demand response programs aim to activate demand flexibility at the household level. In recent years, reinforcement learning (RL) has gained significant attention for these type of applications. A major challenge of RL algorithms is data efficiency. New RL algorithms, such as proximal policy optimisation (PPO), have tried to increase data efficiency. Additionally, combining RL with transfer learning has been proposed in an effort to mitigate this challenge. In this work, we further improve upon state-of-the-art transfer learning performance by incorporating demand response domain knowledge into the learning pipeline. We evaluate our approach on a demand response use case where peak shaving and self-consumption is incentivised by means of a capacity tariff. We show our adapted version of PPO, combined with transfer learning, reduces cost by 14.51 compared to traditional PPO.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2018

Gotta Learn Fast: A New Benchmark for Generalization in RL

In this report, we present a new reinforcement learning (RL) benchmark b...
research
09/16/2020

Transfer Learning in Deep Reinforcement Learning: A Survey

This paper surveys the field of transfer learning in the problem setting...
research
10/04/2019

Manufacturing Dispatching using Reinforcement and Transfer Learning

Efficient dispatching rule in manufacturing industry is key to ensure pr...
research
12/18/2020

CityLearn: Standardizing Research in Multi-Agent Reinforcement Learning for Demand Response and Urban Energy Management

Rapid urbanization, increasing integration of distributed renewable ener...
research
09/17/2020

Competitiveness of MAP-Elites against Proximal Policy Optimization on locomotion tasks in deterministic simulations

The increasing importance of robots and automation creates a demand for ...
research
06/24/2023

Towards Optimal Pricing of Demand Response – A Nonparametric Constrained Policy Optimization Approach

Demand response (DR) has been demonstrated to be an effective method for...
research
07/20/2021

Proximal Policy Optimization for Tracking Control Exploiting Future Reference Information

In recent years, reinforcement learning (RL) has gained increasing atten...

Please sign up or login with your details

Forgot password? Click here to reset