首页|期刊导航|电力工程技术|基于混合动作空间深度强化学习的电网潮流收敛性调整

基于混合动作空间深度强化学习的电网潮流收敛性调整

吴涛王昊昊李天然

电力工程技术2026，Vol.45Issue(5)：50-60,11.

电力工程技术2026，Vol.45Issue(5)：50-60,11.DOI:10.12158/j.2096-3203.2026.05.005

基于混合动作空间深度强化学习的电网潮流收敛性调整

A power flow convergence adjustment based on deep reinforcement learning with hybrid action space

吴涛 ¹王昊昊 ²李天然¹

作者信息

1. 南京师范大学南瑞电气与自动化学院,江苏南京 210023
2. 国网电力科学研究院有限公司(南瑞集团有限公司),江苏南京 211106
折叠

摘要

Abstract

As the operation modes of power systems become increasingly complex,the difficulty of power flow convergence adjustment also increases.Traditional methods that rely on human expertise suffer from delayed response and low efficiency,making them ill-suited for complex scenarios involving diverse and high-dimensional control variables.To address this challenge,a power flow adjustment method based on deep reinforcement learning with a hybrid action space is proposed.Firstly,a power flow convergence discriminator is developed by integrating physical priors with data-driven techniques to enable real-time identification of whether the power flow converges.The output convergence probability is used as a reward guidance signal in deep reinforcement learning.Then,a reinforcement learning environment for convergence adjustment is defined.The state space integrates system-level statistical features with node-level individual features.The action space encompasses both continuous and discrete control variables.And the reward function combines convergence identification results with feedback from the adjustment process to guide policy optimization toward the feasible region.Next,an Actor-Critic network with a hybrid action space is constructed,which hierarchically decouples and models subtasks including device selection,continuous regulation,and discrete control.Finally,simulation analysis and comparative experiments are conducted on the improved IEEE 39-bus and 118-bus systems.The results demonstrate that the power flow convergence discriminator enhanced with physical priors significantly improves both the accuracy and generalization capability of convergence identification compared to traditional models.Furthermore,the proposed coordinated optimization strategy,which integrates continuous-discrete actions,achieves notable improvements in power flow adjustment efficiency and convergence success rate over existing methods.

关键词

电网潮流/数据驱动/潮流收敛性鉴别器/混合动作空间/深度强化学习/分层解耦/Actor-Critic架构

Key words

power flow/data-driven/power flow convergence discriminator/hybrid action space/deep reinforcement learning/hierarchically decoupled/Actor-Critic architecture

分类

信息技术与安全科学

引用本文复制引用

吴涛,王昊昊,李天然..基于混合动作空间深度强化学习的电网潮流收敛性调整[J].电力工程技术,2026,45(5):50-60,11.

基金项目

智能电网国家科技重大专项资助项目(2024ZD0801103) （2024ZD0801103）

电力工程技术

ISSN：2096-3203

访问量0

下载量0

段落导航