GenCos' Behaviors Modeling Based on Q Learning Improved by Dichotomy

08/04/2020
by   Qiangang Jia, et al.
0

Q learning is widely used to simulate the behaviors of generation companies (GenCos) in an electricity market. However, existing Q learning method usually requires numerous iterations to converge, which is time-consuming and inefficient in practice. To enhance the calculation efficiency, a novel Q learning algorithm improved by dichotomy is proposed in this paper. This method modifies the update process of the Q table by dichotomizing the state space and the action space step by step. Simulation results in a repeated Cournot game show the effectiveness of the proposed algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/04/2023

How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 1: A Paradigmatic Theory

In face of the pressing need of decarbonization in the power sector, the...
research
08/04/2019

Monte-Carlo Tree Search for Simulation-based Strategy Analysis

Games are often designed to shape player behavior in a desired way; howe...
research
06/29/2021

Efficient State-space Exploration in Massively Parallel Simulation Based Inference

Simulation-based Inference (SBI) is a widely used set of algorithms to l...
research
05/04/2023

How to Use Reinforcement Learning to Facilitate Future Electricity Market Design? Part 2: Method and Applications

This two-part paper develops a paradigmatic theory and detailed methods ...
research
03/09/2018

SpCoSLAM 2.0: An Improved and Scalable Online Learning of Spatial Concepts and Language Models with Mapping

In this paper, we propose a novel online learning algorithm, SpCoSLAM 2....
research
12/01/2019

Fast Stochastic Ordinal Embedding with Variance Reduction and Adaptive Step Size

Learning representation from relative similarity comparisons, often call...

Please sign up or login with your details

Forgot password? Click here to reset