Demand-Side Scheduling Based on Deep Actor-Critic Learning for Smart Grids

05/05/2020
by   Joash Lee, et al.
0

We consider the problem of demand-side energy management, where each household is equipped with a smart meter that is able to schedule home appliances online. The goal is to minimise the overall cost under a real-time pricing scheme. While previous works have introduced centralised approaches, we formulate the smart grid environment as a Markov game, where each household is a decentralised agent, and the grid operator produces a price signal that adapts to the energy demand. The main challenge addressed in our approach is partial observability and perceived non-stationarity of the environment from the viewpoint of each agent. We propose a multi-agent extension of a deep actor-critic algorithm that shows success in learning in this environment. This algorithm learns a centralised critic that coordinates training of all agents. Our approach thus uses centralised learning but decentralised execution. Simulation results show that our online deep reinforcement learning method can reduce both the peak-to-average ratio of total energy consumed and the cost of electricity for all households based purely on instantaneous observations and a price signal.

READ FULL TEXT

page 1

page 8

research
10/24/2022

AACHER: Assorted Actor-Critic Deep Reinforcement Learning with Hindsight Experience Replay

Actor learning and critic learning are two components of the outstanding...
research
11/11/2022

Deep Reinforcement Learning Microgrid Optimization Strategy Considering Priority Flexible Demand Side

As an efficient way to integrate multiple distributed energy resources a...
research
06/29/2020

Distributed Deep Reinforcement Learning for Intelligent Load Scheduling in Residential Smart Grids

The power consumption of households has been constantly growing over the...
research
05/14/2020

Continuous Multiagent Control using Collective Behavior Entropy for Large-Scale Home Energy Management

With the increasing popularity of electric vehicles, distributed energy ...
research
02/24/2023

A Novel Demand Response Model and Method for Peak Reduction in Smart Grids – PowerTAC

One of the widely used peak reduction methods in smart grids is demand r...
research
12/18/2016

An Integrated Optimization + Learning Approach to Optimal Dynamic Pricing for the Retailer with Multi-type Customers in Smart Grids

In this paper, we consider a realistic and meaningful scenario in the co...

Please sign up or login with your details

Forgot password? Click here to reset