A Learning-based Optimal Market Bidding Strategy for Price-Maker Energy Storage

06/04/2021
by   Mathilde D. Badoual, et al.
0

Load serving entities with storage units reach sizes and performances that can significantly impact clearing prices in electricity markets. Nevertheless, price endogeneity is rarely considered in storage bidding strategies and modeling the electricity market is a challenging task. Meanwhile, model-free reinforcement learning such as the Actor-Critic are becoming increasingly popular for designing energy system controllers. Yet implementation frequently requires lengthy, data-intense, and unsafe trial-and-error training. To fill these gaps, we implement an online Supervised Actor-Critic (SAC) algorithm, supervised with a model-based controller – Model Predictive Control (MPC). The energy storage agent is trained with this algorithm to optimally bid while learning and adjusting to its impact on the market clearing prices. We compare the supervised Actor-Critic algorithm with the MPC algorithm as a supervisor, finding that the former reaps higher profits via learning. Our contribution, thus, is an online and safe SAC algorithm that outperforms the current model-based state-of-the-art.

READ FULL TEXT
research
04/29/2020

How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization

Deterministic-policy actor-critic algorithms for continuous control impr...
research
12/09/2020

Deep Reinforcement Learning for Long Term Hydropower Production Scheduling

We explore the use of deep reinforcement learning to provide strategies ...
research
01/02/2023

Transferable Energy Storage Bidder

Energy storage resources must consider both price uncertainties and thei...
research
04/18/2022

On Parametric Optimal Execution and Machine Learning Surrogates

We investigate optimal execution problems with instantaneous price impac...
research
06/16/2023

Actor-Critic Model Predictive Control

Despite its success, Model Predictive Control (MPC) often requires inten...
research
04/04/2020

Model-based actor-critic: GAN + DRL (actor-critic) => AGI

Our effort is toward unifying GAN and DRL algorithms into a unifying AI ...
research
02/28/2019

Infer Your Enemies and Know Yourself, Learning in Real-Time Bidding with Partially Observable Opponents

Real-time bidding, as one of the most popular mechanisms for selling onl...

Please sign up or login with your details

Forgot password? Click here to reset