首页|期刊导航|电力工程技术|基于多智能体深度强化学习的多区域负荷频率协同控制

基于多智能体深度强化学习的多区域负荷频率协同控制

许庆禹何宇张靖沈涛齐岳赵健

电力工程技术2026，Vol.45Issue(5)：69-80,12.

电力工程技术2026，Vol.45Issue(5)：69-80,12.DOI:10.12158/j.2096-3203.2026.05.007

基于多智能体深度强化学习的多区域负荷频率协同控制

Multi-area load frequency cooperative control based on multi-agent deep reinforcement learning

许庆禹 ¹何宇 ²张靖 ²沈涛 ¹齐岳 ³赵健³

作者信息

1. 贵州大学电气工程学院,贵州贵阳 550025
2. 贵州大学电气工程学院,贵州贵阳 550025||贵州省电力系统智能化技术重点实验室,贵州贵阳 550025
3. 中国电建集团贵州电力设计研究院有限公司,贵州贵阳 550002
折叠

摘要

Abstract

With the integration of large-scale renewable energy,the frequency stability of power systems has been subjected to severe challenges.In this study,an expert-prefilled multi-agent twin delayed deep deterministic policy gradient(EP-MATD3)algorithm based on an expert data pre-filling mechanism is proposed for multi-area load frequency control.Firstly,a multi-area frequency response model including thermal power units,wind turbines,photovoltaic systems,and energy storage systems is first established.Based on the traditional multi-area tie-line power model,coordinated control between regional controllers is incorporated,by which the interconnection between regions is strengthened and unplanned power exchanges are reduced.Then,the multi-agent twin delayed deep deterministic policy gradient(MATD3)algorithm is adopted to mitigate the Q-value overestimation problem inherent in traditional reinforcement learning,and stability and convergence of the control policy are enhanced.Furthermore,within the collaborative control framework of centralized training and decentralized execution,an expert data pre-filling mechanism is introduced during the centralized training stage,whereby the occurrence of invalid actions during random exploration is limited and the convergence of agent training is accelerated.During the decentralized execution stage,unit power outputs are independently adjusted by the trained agents according to the real-time system states of their respective regions,enabling effective suppression of frequency fluctuations.Through simulation on a three-area power system,it is demonstrated that,compared with traditional methods,the proposed EP-MATD3 control strategy achieves a significant reduction in training time and effectively decreases system frequency deviations under continuous step-load and photovoltaic fluctuation disturbances,thereby verifying its effectiveness and superiority in the frequency control of complex power systems.

关键词

多区域负荷频率控制/协同控制/区域控制误差/多智能体深度强化学习/双延迟深度确定性策略梯度算法/预填充机制

Key words

multi-area load frequency control/cooperative control/area control error/multi-agent deep reinforcement learning/twin delay deep deterministic policy gradient algorithm/pre-filling mechanism

分类

信息技术与安全科学

引用本文复制引用

许庆禹,何宇,张靖,沈涛,齐岳,赵健..基于多智能体深度强化学习的多区域负荷频率协同控制[J].电力工程技术,2026,45(5):69-80,12.

基金项目

国家自然科学基金资助项目(52406227) （52406227）

贵州省科技支撑项目(黔科合成果LH[2025]重点014) （黔科合成果LH[2025]重点014）

电力工程技术

ISSN：2096-3203

访问量0

下载量0

段落导航