What question did this study set out to answer?

This research aims to improve active power scheduling for power systems by using a multi-agent reinforcement learning approach.

March 18, 2026Open Access

Enhancing Active Power Scheduling with Multimodal Grid Data Based on Multi-Agent Reinforcement Learning

Key Points

This research aims to improve active power scheduling for power systems by using a multi-agent reinforcement learning approach.
Developed the HMAT-PPO framework combining Transformers and Proximal Policy Optimization.
Integrated multimodal data sensing to enhance decision-making.
Evaluated the performance against baseline models in a controlled environment.
The HMAT-PPO framework significantly reduces generation costs compared to traditional methods.
Transmission losses are minimized effectively.
Operational constraints are successfully satisfied in various scenarios.

Abstract

The rapid and widespread integration of renewable energy sources introduces significant challenges for power system dispatch. Owing to the inherent intermittency and variability of renewable outputs, conventional active power scheduling methods based on static models are often inadequate for capturing system dynamics and managing operational uncertainties. To address these issues, this paper proposes an optimization approach that integrates multimodal data sensing with multi-agent deep reinforcement learning. The proposed framework, named Heterogeneous Multi-Agent Transformer with Proximal Policy Optimization (HMAT-PPO), combines a Transformer-based architecture with PPO to jointly capture the spatial structures, temporal dynamics, and operational features of power grids. Through the incorporation of multimodal alignment and gated fusion mechanisms, the proposed framework enables the integration of heterogeneous information sources—such as grid topology, load fluctuations, and generator states—which significantly augments the agents’ environmental awareness and promotes collaborative, context-aware decision-making. Experimental results demonstrate that the proposed method consistently outperforms baseline models in minimizing generation cost, reducing transmission losses, and satisfying operational constraints, thereby offering both theoretical significance and practical value.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Liudong Zhang

Wenlu Ji

Tianhai Zhang

Journals

Data Intelligence

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Enhancing Active Power Scheduling with Multimodal Grid Data Based on Multi-Agent Reinforcement Learning

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study