| 注册
首页|期刊导航|电力建设|基于条件风险强化学习的梯级水光蓄联合优化调度

基于条件风险强化学习的梯级水光蓄联合优化调度

陈实 唐国登 刘艺洪 许刘超 朱钰杰 周毅 李华强 臧天磊

电力建设2025,Vol.46Issue(9):84-97,14.
电力建设2025,Vol.46Issue(9):84-97,14.DOI:10.12204/j.issn.1000-7229.2025.09.007

基于条件风险强化学习的梯级水光蓄联合优化调度

Condition-Based Risk Reinforcement Learning for Joint Optimal Scheduling of Cascade Hydropower and Solar-Powered Reservoirs

陈实 1唐国登 1刘艺洪 1许刘超 1朱钰杰 1周毅 1李华强 1臧天磊1

作者信息

  • 1. 四川大学电气工程学院,成都市 610065
  • 折叠

摘要

Abstract

[Objective]Owing to the advancement of the dual-carbon strategy,hydropower developers are actively promoting the pumped-storage retrofitting of hydroelectric facilities while accelerating the regional construction of photovoltaic(PV)and other renewable-energy sources,thus gradually forming a scenario with high-penetration renewables and large-scale pumped storage.However,under the dual uncertainties of renewable-energy output and interval water inflow,the system operational coordination becomes more complex,particularly because scheduling methods for cascade hydropower systems that consider interval inflow uncertainty are rarely investigated.[Methods]This study proposes a conditional risk-aware reinforcement-learning-based optimal scheduling method for cascade hydro-PV-pumped storage systems.First,the"Informer"deep neural network is employed to predict basin-interval water inflow,where inflow uncertainty is transformed into flexibility supply indicators.Subsequently,risk theory is integrated to quantify flexibility deficits using the conditional-value-at-risk measure.Finally,an improved risk-managing proximal policy optimization(RM-PPO)reinforcement-learning algorithm is developed to derive optimized scheduling strategies.[Results]Validation using an actual cascade hydro-PV-pumped storage base in China shows that the proposed forecasting method reduces the MSE and MAE by 18.9%and 58.8%,respectively,compared with conventional time-series approaches,with a 78.5%capture rate for peak events.The RM-PPO scheduling algorithm achieves flexible system regulation through pumped-storage plants coordinated with PVs for surplus-energy absorption,whereas conventional hydropower stations synergize with PVs for source-load dynamic matching.[Conclusions]The RM-PPO-based scheduling strategy effectively reduces costs and controls operational risks while maintaining system flexibility,thus promoting renewable-energy accommodation and enhancing hydropower-utilization efficiency.

关键词

梯级水光蓄/条件风险/强化学习/调度优化

Key words

cascade hydropower-photovoltaic storage/conditional risk/reinforcement learning/scheduling optimization

分类

信息技术与安全科学

引用本文复制引用

陈实,唐国登,刘艺洪,许刘超,朱钰杰,周毅,李华强,臧天磊..基于条件风险强化学习的梯级水光蓄联合优化调度[J].电力建设,2025,46(9):84-97,14.

基金项目

国家自然科学基金资助项目(52377115)This work is supported by the National Natural Science Foundation of China(NSFC)(No.52377115). (52377115)

电力建设

OA北大核心

1000-7229

访问量0
|
下载量0
段落导航相关论文