水资源与水工程学报2026,Vol.37Issue(1):150-159,10.DOI:10.11705/j.issn.1672-643X.2026.01.17
面向保供需求的深度强化学习水-风-光短期优化调度研究
Short-term optimal scheduling of hydro-wind-solar system based on deep reinforcement learning for power supply assurance demand:
摘要
Abstract
To balance the regulation capacity of hydropower stations and the supply reliability of hydro-wind-solar system,we proposed a short-term multi-objective hydro-wind-solar optimization dispatch model targeting supply assurance demand.Aiming at maximizing daily power generation and optimizing load profile matching,we employed SAC(soft actor-critic),a deep reinforcement learning algorithm,to solve the optimal scheduling model.The target curve feature decomposition method was introduced to de-couple the multiple objectives,and the hierarchical reward mechanism was designed to improve the strat-egy convergence performance.Taking the Pubugou,Shenxigou,Zhentouba(Pu-Shen-Zhen)cascade hydro-wind-solar system of the Dadu River Basin as a case study,we simulated the system operational states of typical days in flood season,normal season and dry season,under the scenarios of high and low output of wind and solar power.The simulation results were then compared with those of the conventional optimization algorithms,namely,progressive optimization algorithm(POA)and genetic algorithm(GA).The comparison shows that SAC exhibits excellent load tracking performance across all test scenari-os(with source-load correlation coefficient>99.9%),and achieves solutions with higher system power generation.It can achieve an average increase of 0.54%and 0.27%in cascade hydropower generation,compared with POA and GA.This result demonstrates that the maximum entropy exploration mechanism of SAC can effectively tap potential high-quality solutions,and balance the reliability of power supply as-surance and power generation,when the operational constraints are met.The proposed method can pro-vide a theoretical support and technical path for the intelligent optimal operation of hydro-wind-solar integrated systems.关键词
水-风-光一体化/电力保供/深度强化学习/SAC算法/短期优化调度/大渡河流域Key words
hydro-wind-solar integration/power supply assurance/deep reinforcement learning/soft actor-critic(SAC)/short-term optimal scheduling/the Dadu River Basin分类
信息技术与安全科学引用本文复制引用
王建华,易绍雯,朱燕梅,周玲,黄炜斌,马光文..面向保供需求的深度强化学习水-风-光短期优化调度研究[J].水资源与水工程学报,2026,37(1):150-159,10.基金项目
国家自然科学基金项目(U24B6011) (U24B6011)
国能大渡河流域水电开发有限公司技术服务项目(GJNY-DDH-SCZH-2025-002) (GJNY-DDH-SCZH-2025-002)