What question did this study set out to answer?

The aim is to develop an optimal model for reservoir operations that incorporates inflow forecasts through deep reinforcement learning.

April 18, 2026Open Access

A Novel and Optimal Reservoir Operation Model Incorporating Inflow Forecasts Based on Deep Reinforcement Learning Algorithms

Key Points

The aim is to develop an optimal model for reservoir operations that incorporates inflow forecasts through deep reinforcement learning.
Built a novel reservoir operation model with inflow forecasts based on DRL.
Utilized a multi-dimensional reward function derived from objective functions and constraints.
Evaluated model performance against actual operation results using historical daily flow data and inflow forecasts.
Dynamic weighting in Scheme-1 increased annual average flood prevention storage capacity by approximately 36.8%.
Enhanced power generation by about 2.86 billion kW·h, equating to a 5.49% increase.
Reduced spillway waste water volume by around 3.33 billion m3 compared to actual operations.

Abstract

Deep reinforcement learning (DRL) has been increasingly used in reservoir operation, but several key challenges and limitations need further study. This paper developed a novel and optimal reservoir operation model incorporating inflow forecasts based on DRL and the deterministic policy gradient algorithm. A multi-dimensional reward function was derived from the objective functions and constraints, and an optimal scheduling scheme was established with dynamically weighted reward functions. The observed daily flow data and 5-day inflow forecasts of the Three Gorges Reservoir (TGR) during flood seasons (from 10 June to 31 October) from 2010 to 2025 were used to evaluate the model performance and compared with the actual operation results. The results show that, compared with the actual operation, Scheme-1 with dynamic weights increases annual average flood prevention storage capacity by approximately 36.8%, enhances power generation by about 2.86 billion kW·h (≈5.49%), and reduces spillway waste water volume by around 3.33 billion m3. This study demonstrates that the optimal scheduling model can substantially improve the overall efficiency of reservoir operation, and the improvement is even more pronounced when the reward function weights are set dynamically.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Xiang et al. (Thu,) studied this question.

www.synapsesocial.com/papers/69e3209340886becb653f9fa — DOI: https://doi.org/10.3390/w18080948

Authors

Xin Xiang

Shenglian Guo

bokai Sun

Journals

Water

Actions

Institutions

Wuhan University

Yangtze River Pharmaceutical Group (China)

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

A Novel and Optimal Reservoir Operation Model Incorporating Inflow Forecasts Based on Deep Reinforcement Learning Algorithms

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion