首页|期刊导航|中国电机工程学报|考虑参数共享的深度强化学习双阶段分布式电源优化

考虑参数共享的深度强化学习双阶段分布式电源优化

高放姚浩天高庆殷林飞蔡运翔金岩潘宇

中国电机工程学报2025，Vol.45Issue(19)：7493-7509,中插19,18.

中国电机工程学报2025，Vol.45Issue(19)：7493-7509,中插19,18.DOI:10.13334/j.0258-8013.pcsee.240635

考虑参数共享的深度强化学习双阶段分布式电源优化

Two-stage Distributed Generators Optimization Based on Deep Reinforcement Learning With Parameter Sharing

高放 ¹姚浩天 ¹高庆 ²殷林飞 ¹蔡运翔 ¹金岩 ¹潘宇³

作者信息

1. 广西大学电气工程学院,广西壮族自治区南宁市 530004
2. 北京航空航天大学杭州创新研究院,浙江省杭州市 311100
3. 工业控制技术国家重点实验室(浙江大学),浙江省杭州市 310027
折叠

摘要

Abstract

As renewable energy sources such as solar and wind are increasingly integrated into the grid at high proportions,the optimization and scheduling of distributed power generation face challenges due to frequent changes in system topology,affecting the stability and economic operation of the distribution network.Existing methods,designed for systems with fixed topology,rely on precise models and are time-consuming,making real-time control difficult.Current deep reinforcement learning approaches struggle to balance distributed training and mixed discrete-continuous action spaces.This study introduces a distributed power optimization strategy based on multi-agent deep reinforcement learning with two stages and parameter sharing.Initially,the problem is vertically decoupled,constructing a dynamic distribution network reconfiguration model with distributed generation using mixed-integer second-order cone programming to determine the topology.Subsequently,the distribution network environment is horizontally decoupled into several regions.In the second stage,a centralized training with decentralized execution framework that incorporates parameter sharing is proposed.This framework incorporates a multi-agent prioritized double-delay deep deterministic policy gradient algorithm with a priority experience replay mechanism.Topology information is embedded into the distribution network environment,mapped to agents through power flow calculations to minimize network active power loss in the optimization scheduling model.Case studies demonstrate that the proposed algorithm,by considering changes in the distribution network topology and enhancing learning efficiency through strategy and experience sharing among agents,as well as priority experience replay,meets the efficiency requirements of real-time online decision-making and shows superior voltage stability and loss reduction performance compared to other strategies.

关键词

深度强化学习/参数共享/动态重构/拓扑变化/分布式电源优化调度

Key words

deep reinforcement learning/parameter sharing/dynamic reconfiguration/topological changes/distributed generation optimization scheduling

分类

动力与电气工程

引用本文复制引用

高放,姚浩天,高庆,殷林飞,蔡运翔,金岩,潘宇..考虑参数共享的深度强化学习双阶段分布式电源优化[J].中国电机工程学报,2025,45(19):7493-7509,中插19,18.

基金项目

国家重点研发计划项目(2022YFB3304700) （2022YFB3304700）

国家自然科学基金项目(62463001) （62463001）

工业控制技术全国重点实验室开放课题(ICT2024B21).National Key R&D Program of China(2022YFB3304700) （ICT2024B21）

The Project Supported by National Natural Science Foundation of China(62463001) （62463001）

The Open Research Project of the State Key Laboratory of Industrial Control Technology(China)(ICT2024B21). （China）

中国电机工程学报

OA北大核心

ISSN：0258-8013

访问量0

下载量0

段落导航