Log In Sign Up

Multi-Stage Transmission Line Flow Control Using Centralized and Decentralized Reinforcement Learning Agents

by   Xiumin Shang, et al.

Planning future operational scenarios of bulk power systems that meet security and economic constraints typically requires intensive labor efforts in performing massive simulations. To automate this process and relieve engineers' burden, a novel multi-stage control approach is presented in this paper to train centralized and decentralized reinforcement learning agents that can automatically adjust grid controllers for regulating transmission line flows at normal condition and under contingencies. The power grid flow control problem is formulated as Markov Decision Process (MDP). At stage one, centralized soft actor-critic (SAC) agent is trained to control generator active power outputs in a wide area to control transmission line flows against specified security limits. If line overloading issues remain unresolved, stage two is used to train decentralized SAC agent via load throw-over at local substations. The effectiveness of the proposed approach is verified on a series of actual planning cases used for operating the power grid of SGCC Zhejiang Electric Power Company.


page 1

page 2

page 3

page 4


Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control

The distributed Volt/Var control (VVC) methods have been widely studied ...

Reinforcement Learning based Proactive Control for Transmission Grid Resilience to Wildfire

Power grid operation subject to an extreme event requires decision-makin...

Contrasting Centralized and Decentralized Critics in Multi-Agent Reinforcement Learning

Centralized Training for Decentralized Execution, where agents are train...

Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer

The increased integration of renewable energy poses a slew of technical ...

Lifelong Control of Off-grid Microgrid with Model Based Reinforcement Learning

The lifelong control problem of an off-grid microgrid is composed of two...

Data-Driven Decentralized Optimal Power Flow

The implementation of optimal power flow (OPF) methods to perform voltag...

Fast Power system security analysis with Guided Dropout

We propose a new method to efficiently compute load-flows (the steady-st...