全球能源互联网2026,Vol.9Issue(1):60-71,12.DOI:10.19705/j.cnki.issn2096-5125.20240492
面向强不确定性供需波动的新能源电网调度强化学习算法
A Reinforcement Learning Algorithm for New Energy Grid Dispatch with Strong Uncertainty in Supply and Demand Fluctuations
摘要
Abstract
As the new energy industry gradually scales up,dynamic economic dispatch in power systems aimed at enhancing new energy absorption capacity is becoming increasingly crucial.This paper proposes a distributed multi-agent reinforcement learning algorithm to address the dynamic economic dispatch problem in systems with high proportions of new energy.First,the total power demand of the grid at each time step is obtained through an average consensus algorithm,and a feasible power output that meets coupling constraints and fully absorbs new energy is determined using a projection optimization method.Additionally,by employing a quadratic function to approximate the state-action value function of the evaluation network,another near-optimal solution for dispatch is derived by solving a convex optimization problem.Subsequently,an action network is constructed to directly learn the relationship between total power demand,real-time maximum available output of new energy units,and the active power output decisions of each thermal power unit.Leveraging experience from extensive training,the model swiftly predicts optimal output power in dynamic power systems,thus improving generator decision efficiency.Finally,the effectiveness and robustness of the proposed algorithm are verified through tests on the IEEE 39-bus system.关键词
智能电网/新能源消纳/动态经济调度/分布式强化学习Key words
smart grid/new energy absorption/dynamic economic dispatch/distributed reinforcement learning分类
信息技术与安全科学引用本文复制引用
周宁,徐铭铭,刘清秋,黄玉雄,陈明,李更丰..面向强不确定性供需波动的新能源电网调度强化学习算法[J].全球能源互联网,2026,9(1):60-71,12.基金项目
国家电网有限公司科技项目(521702230005/SGHADK00PJJS2400401). Science and Technology Foundation of SGCC(521702230005/SGHADK00PJJS2400401). (521702230005/SGHADK00PJJS2400401)