March 3, 2026Open Access

Graph-based Safe Reinforcement Learning for Dynamic Optimal Power Flow with Hybrid Action Space Considering Time-varying Network Topologies

Key Points

Efficient navigation of discrete-continuous decision space enables optimized power flow management.
Numerical results indicate the proposed approach improves energy scheduling under dynamic network topologies.
Utilization of graph convolution operators helps adapt to real-time changes in network state.
A parameterized action constrained Markov decision process ensures compliance with physical network constraints.

Abstract

The proliferation of distributed energy resources and time-varying network topologies in active distribution networks presents unprecedented challenges for network operators. While reinforcement learning (RL) has shown promise in addressing network-constrained energy scheduling, it faces difficulties in managing the complexities of dynamic topologies and discrete-continuous hybrid action spaces. To address these challenges, a graph-based safe RL approach is proposed to learn dynamic optimal power flow under time-varying network topologies. This proposed approach leverages graph convolution operators to handle network topology changes, while safe RL with parameterized action ensures policy development. Specifically, the graph convolution operator abstracts key characteristics of the network topology, enabling effective power flow management in non-stationary environments. Besides that, a parameterized action constrained Markov decision process is employed to handle the hybrid action space and ensure compliance with physical network constraints, thereby accelerating the deployment of safe policy for hybrid action spaces. Numerical results demonstrate that the proposed approach efficiently navigates the discrete-continuous decision space while accounting for the constraints imposed by the dynamic nature of power flow in time-varying network topologies.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Zhang Xihai

Ge Shaoyun

Yue Zhou

Journals

Journal of Modern Power Systems and Clean Energy

Actions

Institutions

Cardiff University

Tianjin University

Zhengzhou University

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Graph-based Safe Reinforcement Learning for Dynamic Optimal Power Flow with Hybrid Action Space Considering Time-varying Network Topologies

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study